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Abstract 

Three linear controllers are designed to regulate the end effector of the Space Shuttle Remote 
Manipulator System (SRMS) operating in Position Hold Mode. In this mode of operation, jet 
firings of the Orbiter can be treated as disturbances while the controller tries to keep the 
end effector stationary in an Orbiter-fixed reference frame. The three design techniques used 
include: the Linear Quadratic Regulator (LQR), H? optimization, and optimization. 

The nonlinear SRMS is linearized by modelling the effects of the significant nonlinearities 
as uncertain parameters. Each regulator design is evaluated for robust stability in light of the 
parametric uncertainties using both the small gain theorem with an Hoc norm and the less 
conservative /i-analysis test. 

All three regulator designs offer significant improvement over the current system on the 
nominal plant. Unfortunately, even after dropping performance requirements and designing 
exclusively for robust stability, robust stability cannot be achieved. The SRMS suffers from 
lightly damped poles with real parametric uncertainties. Such a system renders the /i-analysis 
test, which allows for complex perturbations, too conservative. 
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Chapter 1 


Introduction 


Future Space Shuttle operations include the on-orbit assembly of the Space Station Freedom 
(SSF). This assembly process will require the Shuttle Remote Manipulator System (SRMS) to 
manipulate many separate elements of the SSF toward its ultimate construction. Experience in 
SRMS payload operations gained over the last decade indicate deficiencies in current operations. 
Some of these deficiencies are of particular concern for the assembly of the SSF. One such 
concern is the long durations of time required to damp vibrations of the SRMS and attached 
payload after a given maneuver. Attitude thruster firings of the Shuttle Flight Control System 
(SFCS) during SRMS operations is another source of undesired excitations. The time spent 
waiting for these undesired vibrations to damp out can consume a significant fraction of the 
total mission time. For example, the estimated time spent waiting for the SRMS to damp during 
the first sixteen Shuttle flights required for the SSF assembly task is illustrated in Figure 1.1 [1]. 

The SRMS consists of six joints connected via tubular structural members with a grapple 
at the end effector for grasping payloads. Some of these structural members are longer than 
6 meters. The total length of the extended arm is over 15 meters, making for a very flexible 
structure. Control of the arm is performed by six motor rate feedback servos, one for each joint. 
In order to perform end effector or payload maneuvers, the SRMS has five modes of operation. 
These modes include the Automatic Mode, the Manual Augmented Mode, the Single Joint 
Mode, the Direct Drive Mode, and the Backup Drive Mode. The Direct Drive Mode and the 
Backup Drive Mode are both contingency modes that allow the SRMS to operate even if certain 
system failures occur. After the SRMS completes a maneuver in one of the three regular modes, 
each servo is initially given a zero rate command. After the zero rate command state, the SRMS 
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Flight number 

Figure 1.1. Cumulative SRMS Damping Time 

drops into Position Hold. In Position Hold, as the name implies, the system tries to maintain 
its final end-point position and attitude. The design philosophy for the SRMS is discussed in 
detail by Ravindran [2]. 

Currently, the SRMS maintains Position Hold end-point control using the joint control 
servos. In Position Hold, each joint is equipped with a controller that supplies joint rate 
commands based upon angle feedback from joint encoders. However, since boom flexibility 
is not observable in the joint angle feedback measurements, no active vibration suppression 
currently exists for control of the SRMS and attached payload. Simple movements of the SRMS 
with a heavy payload attached can cause vibration of the arm with frequencies inside the control 
bandwidth. An even larger source of excitation is Shuttle attitude control thruster firings that 
are required to support payload operation constraints such as: lighting, contamination, and 
release or tip-off requirements. The system must wait for these vibrations to die out since it is 
not actively damping them. Current control system design technology offers the opportunity 
to design controllers that will actively damp such undesired vibrations. The key is to provide 
such controllers with information about the structural flexibility of the SRMS. 

Gilbert, et al. [1] investigated active damping of the SRMS using a Linear Quadratic Gaus- 
sian (LQG) controller. Gilbert approached the problem by replacing the zero rate command 
state with a LQG controller after the SRMS finishes a maneuver. Assuming linear accelerom- 
eters on the end effector, this controller damps the joint rate commands to a specified level 
before falling into Position Hold. Gilbert has applied this technique to the Single Joint and 
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Manual Augmented Modes of operation. 

Engineering simulations have shown that Position Hold itself can be unstable, particularly 
with heavy payloads associated with the SSF assembly task. There are several advantages in 
providing compensation for the current Position Hold mode. Since Position Hold is universal to 
the three standard control modes, improving its performance improves the SRMS over its full 
range of normal operations. Furthermore, the joint deflections are relatively small in Position 
Hold. Therefore, the nonlinear plant associated with a six degree-of-freedom robotic manip- 
ulator is operating in a fairly linear region. Significant nonlinearities come from the servos 
rather than the kinematics associated with the geometry. The servo nonlinearities are easier to 
characterize using linear control design techniques. 

This thesis provides a survey of several linear control design techniques as applied to vibra- 
tion damping and disturbance rejection of the SRMS while operating in Position Hold. Rather 
than damping the joint angles, the end effector position and attitude are damped since reduc- 
ing oscillations of the payload itself is more crucial. Design methodologies examined include: 
Linear Quadratic Regulator (LQR) synthesis, Linear Quadratic Gaussian or H<i optimization, 
and Hqq optimization derived compensation. Designs are compared with the current SRMS 
Position Hold controller in both the frequency and time domains. Regulators are designed to 
maintain the SRMS end effector position and attitude relative to an Orbiter-fixed reference 
frame while being subjected to Shuttle thruster firings. 

There are several significant obstacles to overcome before linear control design techniques 
can be applied to this problem. First, the nonlinear system must be linearized. Because the arm 
is essentially in a fixed configuration during Position Hold, the six servos of the SRMS are the 
main sources of nonlinearities. These nonlinearities include: current and torque limiters, trim 
integrators, gearbox stiffness, digital and analog tachometers, and joint motor stiction/frictions. 
The effect of these nonlinearities must either be deemed insignificant, or somehow linearized 
to a nominal value. One method to model the effects of the significant nonlinearities on the 
linearized system is to represent them as parameter uncertainties. In this manner, it is possible 
to perform robust stability analysis via ^-analysis on the linear plant. This analysis will help 
indicate if the system is stable in face of the nonlinearities present in the system. 

These nonlinearities that are modelled as parametric uncertainties pose a significant chal- 
lenge to any linear controller design. The current Position Hold system is not robustly stable 
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with these nonlinearities as seen by its limit cycle behavior. In this thesis, each of the three 
design methodologies, LQR, and H OQi offer significant performance improvement over the 
current system. However, none of the designs achieves robust stability. Because the uncertainty 
of the system is parametric, not unmodeled dynamics, a simple reduction of the bandwidth will 
not achieve robust stability. The outline of this thesis is as follows: 

Chapter 2 

This Chapter describes in detail the Shuttle Remote Manipulator System. The Position 
Hold mode is the focus of discussion to illuminate the advantages in improving its performance. 
Aside from a physical description of the SRMS, the five control modes are explored in depth: 
the Automatic Mode, the Manual Augmented Mode, the Single Joint Mode, the Direct Drive 
Mode, and the Backup Drive Mode. 

Chapter 3 

This Chapter presents the development of the SRMS model. The equations of motion are 
derived for a simple one joint case and then extended to six joints. A nonlinear servo model is 
presented and then linearized by eliminating those effects that are negligible. Two nonlinearities 
that have significant contribution are treated as nominal parameters with uncertainty. The 
system is placed in a generalized three-block form. Time domain simulations of both the linear 
model and a higher fidelity nonlinear model of the SRMS are compared to verify the linear 
model. 

Chapter 4 

This Chapter starts out by examining the characteristics of the open-loop design plant. 
Model reduction is applied since six poles are very fast relative to the rest of the system. This 
reduced order model is used to design new regulators in subsequent Chapters. The performance 
and control effort criteria are presented that the subsequent regulator designs will be judged 
by. As a baseline for comparison, the performance of the current system is analyzed. Two 
techniques for robust stability analysis, the small gain test and p- analysis, are presented as 
well. These techniques are applied to the current Position Hold controller to illustrate the 
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process as well as explore the characteristics of the current regulator. The shortfalls of these 
techniques when applied to real parametric uncertainty are explored as well. 

Chapter 5 

In this Chapter, the first new regulator is designed using the Linear Quadratic Regulator 
(LQR) methodology. This design requires full-state feedback uncorrupted by sensor noise. The 
basic theory as well as the design process is explored. The resulting regulator’s performance is 
compared to the current system. Robust stability analysis is performed using the techniques 
developed in Chapter 4 . 

Chapter 6 

An #2 optimal compensator is developed in this Chapter. The H 2 theory is presented as 
well. This model-based compensator does not require full-state feedback. The H2 regulator 
relies on state estimates obtained from the noisy measurements of the system to minimize the 
H2 norm of a transfer function. As before, the resulting regulator’s performance is compared to 
the current Position Hold. Robust stability analysis is performed using the techniques developed 
in Chapter 4 . 

Chapter 7 

In this Chapter, an H 0 0 optimal compensator is developed. Both the theory and the design 
process parallel the H2 design of Chapter 6 . The //<*> regulator minimizes the H norm of a 
transfer function. The H regulator is compared to both the current system and the H2 system 
to help highlight the difference in the choice of norms. Robust stability is explored as before. 
In an effort to obtain a robustly stable compensator, a different transfer function whose norm 
is minimized is selected. In this manner, it is possible to design for robust stability directly. 

Chapter 8 

This Chapter offers a summary of the conclusions reached during the development of this 
thesis and recommendations for further work. 
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Chapter 2 


Description of the Shuttle Remote 
Manipulator System 


The Shuttle Remote Manipulator System (SRMS) is an integral subsystem of the Payload 
Deployment and Retrieval System (PDRS) on board the Space Shuttle. The SRMS is used 
primarily for the deployment of payloads from the Orbiter cargo bay, as well as retrieving 
payloads from orbit and stowing them in the cargo bay. Payloads of up to 30,000 kg mass with 
dimensions of up to 18.3 meters in length and 4.3 meters in diameter can be handled by the 
system from up to a 15 meter distance in space. Other applications supported by the SRMS 
include: crew extravehicular activities (EVA); inspection, servicing and repair of spacecraft; 
transfer of men and equipment; as well as the on-orbit assembly of the Space Station Freedom 
(SSF) [2], 

The SRMS is operated by a mission specialist from the aft port window location of the crew 
compartment in the Orbiter. The SRMS operator uses a dedicated control system with the aid 
of direct viewing as well as a closed circuit television. Television cameras are located in the 
cargo bay as well as being mounted on the arm itself. 

SPAR Aerospace Limited in Toronto in conjunction with a team of Canadian companies 
performed design, development, testing, and manufacture of the SRMS. All of the work was 
performed under a contract from the National Research Council of Canada (NRCC) working 
under the auspices of NASA [2]. 

This Chapter describes the SRMS system in detail so that the motivation for design choices 
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Figure 2.1. Shuttle Remote Manipulator System 

can be understood. Section 2.1 details the physical structure of the SRMS. Physical limitations 
of the arm are also discussed. Section 2.2 explains the five control modes of the SRMS. Of these 
five control modes, three are for routine Shuttle operations and the remaining two are backup 
systems. Section 2.3 concentrates on Position Hold and points out its shortcomings. 


2.1 Physical Description 

The SRMS is comprised of a six degree-of-freedom controlled anthropomorphic man-machine 
system. This system includes an approximately 15 meter long manipulator arm attached via 
a swing-out joint to the port longeron of the Shuttle Orbiter cargo bay, a control and display 
system, a controller interface unit between the manipulator and the Orbiter computers, as well 
as the closed circuit televisions and lighting systems [3]. A general diagram of the SRMS is 
shown in Figure 2.1 [2]. 

The mechanical arm assembly is shown in its stowed position in Figure 2.2. The arm is 
comprised of six joints connected via structural elements providing a hemisphere of reach for the 
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Figure 2.2. Mechanical Arm Assembly 

end effector. The joint sequence is as follows: the shoulder yaw and shoulder pitch joints near 
the arm attachment point; the elbow pitch joint at the arm midsection; and finally three joints, 
wrist pitch, yaw and roll, near the arm tip. Three dimensional translation of the end effector 
is provided by the shoulder and elbow joints, whereas the three wrist joints are responsible for 
its three dimensional orientation [4]. The upper boom of the arm, which is 6.4 meters long, 
connects the shoulder and elbow joint. Next, the 7 meter lower arm boom connects the elbow 
joint to the wrist pitch joint. The end effector is attached to the wrist roll joint. The arm 
booms are made from a graphite/epoxy composite material. They have a thin-walled tubular 
cross section with internal stiffening rings [3]. 

The electro-mechanical servos of each joint are similar to each other except for the gear 
trains. The gear trains vary in their gear ratios with a maximum of 1842:1 for the shoulder 
joint. All gear trains are designed to provide both forward and backward drive capabilities. 
Backdrive occurs when a joint is driven by an external torque applied to the output of the 
gearbox. Backdrive is necessary to limit the loads which can be applied to each joint. 

Each servo is comprised of a motor module and a high efficiency, low speed epicyclic gear 
train. Each motor module has a brushless DC permanent magnet motor, a primary and backup 
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optical commutator, a tachometer, and an electro-mechanical brake. Angle position feedback 
is provided from each joint via optical position encoders [2]. 

The end effector is attached to the wrist roll joint. The end effector is comprised of three 
snare wires allowing it to grapple a payload and keep it rigidly attached to the Orbiter, or to 
deploy a payload. Capture or release of a payload is controlled by the arm operator who can 
view the grapple fixture through a wrist camera [5]. 

2.2 SRMS Control Modes 

There are five control modes of operation for the SRMS; three standard and two back-up 
modes. The purpose of the SRMS control algorithms is to convert various inputs from several 
possible sources into rate commands for the six joint servos. The servo rates are the only 
control input of the system. Currently the only feedback to the control algorithms, besides the 
operator in the loop, are the joint angles from the high precision position encoders in each joint. 
Typically the operator desires to control the end effector or some point of resolution (POR) 
on the payload; either rotationally, translationally, or a combination of both. Inputs to the 
control algorithms could come from either hand controller inputs made by the arm operator, 
or an end effector auto sequence trajectory stored in the Orbiter’s general purpose computers 
(GPC) before flight [2]. 

The five modes of operation of the SRMS are: 

1. Manual Augmented Mode 

2. Automatic Mode 

• operator commanded 

• preprogrammed auto sequence 

3. Single Joint Drive Mode 

4. Direct Drive Mode 

5. Backup Drive Mode 

The Manual Augmented Mode includes the operator in the control loop; this is the normal 
mode of operation of the arm. The operator commands the end effector (or POR of the payload) 
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translational and rotational velocity using the two, three-degree-of-freedom hand controllers. 
Commands from the hand controllers are resolved into joint rate commands by the SRMS 
computers. The operator has a choice of four different coordinate operating systems [2]. In this 
mode of operation, all joints can be commanded simultaneously. 

Two types of Automatic Modes are available in the SRMS. In the operator commanded 
auto sequence, the arm is driven along a ‘straight line’ to coordinates input to the GPC by 
the operator during flight. During a preprogrammed auto sequence, the operator can select 
up to four of 20 preprogrammed trajectories to execute at one time. These preprogrammed 
trajectories, which are verified before flight, are composed of up to 200 straight line elements 
stored in the GPC. The operator can select any point along the straight line elements as a 
pausing point for the arm until he or she requests it to complete the trajectory [3]. 

In the Single Joint Drive Mode the operator can control an individual joint with the full 
support of the GPC. The operator selects the joint and provides a fixed drive signal. The GPC 
then interprets the appropriate rate commands. The remaining joints are kept in a Position 
Hold status as described in Section 2.3 [3]. 

Direct Drive Mode is the first contingency mode of the SRMS. As in the Single Joint Mode, 
the operator may command only one joint at a time. In this mode, the remaining joints are 
deactivated and held in position by their brakes. Commands to the working joint are hardwired 
to the Motor Drive Amplifiers (MDA). Commands bypass the GPC, data buses, and the servo 
control loop [2]. 

The Backup Drive Mode is an additional contingency mode provided to help insure the 
SRMS is fail-safe. Similar to the Direct Drive Mode, only one joint may be controlled at a 
time. It is used when no primary drive channels are available. It bypasses all primary channel 
electronics, including the GPC, and uses a separate Backup Drive Amplifier (BDA) and backup 
motor commutator driven by a separate power source [3]. 

2.3 Position Hold 

After completion of an SRMS maneuver in one of the three normal modes of operation, 
each control servo is given a zero rate command for a short period of time (determined by 
the payload/arm configuration). After the zero rate command, each servo drops into Position 
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Figure 2.3. Typical SRMS Maneuver Profile 

Hold Mode. This zero rate command period is the window in which the controllers designed 
by Gilbert, et al.[l] reside. It is designed to damp end effector velocities as quickly as possible. 
A typical SRMS maneuver time profile is shown in Figure 2.3. 

As the name implies, the purpose of the Position Hold Mode is too keep the arm in a fixed 
position. This mode is employed after a maneuver to give the SRMS time to settle and maintain 
a fixed position and attitude. At this point the operator either moves on to the next maneuver, 
or has reached the desired final position. This is actually a subtle point; between maneuvers of a 
given series of maneuvers the operator is not necessarily concerned with a particular end effector 
position and attitude, rather he or she would like all oscillations to stop before moving on to 
the next maneuver. However, after the final maneuver of a sequence the end effector position 
and attitude is the principal concern, especially in operations such as SSF assembly. For this 
reason, linear controller designs presented subsequently will regulate end effector position and 
attitude rather than its velocity. 

During Position Hold Mode, joint angle positions measured via the joint angle encoders 
are fed back to the servo control loops. It is important to realize that the joint angles are 
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converted into joint rate commands via proportional feedback only. With only the joint angles 
provided to the servo control loops, no information about flexibility in the manipulator booms 
is available. Currently the operator must wait for the arm structure to damp excitations in the 
booms resulting from residual energies after a maneuver. Furthermore, while the SRMS is in 
Position Hold, current guidelines allow Shuttle attitude reaction control system (RCS) firings. 
In such a situation, Position Hold should maintain the end effector in its current position and 
attitude relative to an Orbiter-fixed reference frame. However, Shuttle attitude jet firings excite 
considerable flex in the booms of the SRMS. Furthermore, when heavy payloads (>10K lbs) 
are deployed on the SRMS, this RCS induced flexure can result in closed-loop Shuttle flight 
control system instabilities. 

If it is desired to reduce the fraction of mission time squandered waiting for the SRMS to 
settle and to reduce dynamic interaction with other control systems, the benefits of focusing 
on improving Position Hold Mode are apparent. Position Hold Mode is universal to the three 
regular control modes; therefore, improving its performance improves the SRMS for its full 
range of normal operation. 
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Chapter 3 


Modelling the SRMS 


Modelling is a delicate task of representing the dynamics of the Orbiter/SRMS/Payload 
system which are important for the envelope of design while ignoring dynamics which are out- 
side the envelope. We are interested in improving SRMS performance while it is operating in its 
Position Hold Mode. Position Hold Mode is a regulator; one that desires to subdue initial con- 
ditions (residual energies from arm maneuvers), and reject disturbances (introduced by Shuttle 
attitude jet firings). In this mode, the arm is essentially in a fixed position; therefore, chang- 
ing mass and inertia matrices associated with changing arm geometries are not a significant 
factor. However, the six electro-mechanical control servos at the joints do contain substantial 
nonlinearities in their dynamics which cannot be ignored. 

The approach taken to modelling the SRMS is to divide the system into two subsystems. 
First, the dynamics of the Orbiter/SRMS/Payload (arm) subsystem are considered without the 
dynamics of the servos. Second, the control servo dynamics are modelled. A block diagram 
depicts the total system in Figure 3.1. 

Inputs to the ‘arm dynamics’ subsystem include Shuttle attitude thruster firings, sensor 
noise, as well as the torques on each joint generated by their corresponding control servo. 
These inputs are combined to determine the end effector acceleration. This acceleration is then 
integrated to obtain the outputs of this subsystem which are end effector position and attitude 
as well as the six joint angles. The joint angles are fed back through a proportional feedback 
gain matrix to the control servos. If we consider the joint angle and end effector perturbations 
to be small, which is the case in Position Hold, this subsystem is essentially linear. 

The control servo subsystem consists of six servos which have as inputs joint rate com- 
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servo uncertainties 



Figure 3.1. Block Diagram of the Compensated System 

mands from a designed controller as well as the joint angles determined by the arm dynamics. 
This subsystem has as outputs the torques generated on each joint. There are several nonlin- 
earities present in each servo, including: current and torque limiters, trim integrator limiter, 
gearbox stiffness, digital and analog tachometers, as well as joint motor stiction/frictions. The 
approach taken to linearize this nonlinear subsystem is to treat the nonlinearities as modelling 
uncertainties on a linear plant. 

Section 3.1 generates the equations of motion for the Orbiter/SRMS/Payload system. Equa- 
tions are generated in an Orbiter-fixed reference frame. The equations of motion are derived for 
a simplified one joint example and then extended to six joints. Assumptions used in modelling 
are also outlined. 

Section 3.2 begins with a nonlinear model of the joint servos and obtains a simplified linear 
model. This linear model is then combined with the arm plant in order to generate a linearized 
model of the complete SRMS. In order to linearize the control servos, certain nonlinearities 
are ignored which have minimal contribution in Position Hold Mode. The remaining nonlin- 
earities are linearized by selecting ‘nominal’ values, but their effects are modelled as parameter 
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uncertainties on the linear plant. The standard three block form of a transfer function used for 
design and robust analysis is introduced as well. The limitations of the methods used in this 
Section are also discussed. 

The resulting 24 state linearized model is compared with a higher fidelity nonlinear simu- 
lation in Section 3.3. This nonlinear simulation has been compared with actual flight data to 
confirm its accuracy. Comparisons are made in the time domain in Position Hold Mode. The 
effects of varying the parameter uncertainties of the linear plant in an effort to capture the 
range of the nonlinear behavior are also investigated. Section 3.4 draws the conclusions of this 
Chapter. The deficiencies of the major assumptions used in this Chapter are also re-explored. 


3.1 Equations of Motion for the Orbiter/SRMS /Payload Sys- 
tem 

Much of the derivation of the equations of motion in this section is based on the SRMS 
simulator LSAD 1 . LSAD is a higher fidelity nonlinear simulation of the SRMS and flight control 
system of the Space Shuttle. Details on LSAD and the basis of this section can be found in 
References [7], [8], and [9], 

In deriving the equations of motion, the mass of the connecting structure between the Or- 
biter and payload can be ignored when a payload mass exceeds 5,000 lbs. Previous simulations 
have found this assumption to be reasonable. LSAD as well is based on this assumption. Ref- 
erence [7] has the detailed derivation of the rotational dynamics between two bodies connected 
via a massless, flexible structure. Rather than derive the six joint case, Section 3. 1. 1 examines 
the one joint case and Section 3.1.2 extends it to six joints based on the work in Reference [7]. 

3.1.1 Equations of Motion for the Single Joint Case 

Figure 3.2 represents the case of a single joint in Position Hold. The SRMS h as been recast 
as a point mass, m, oscillating on the end of a massless flexible beam which is attached to a 
single joint. A joint in Position Hold behaves as a spring with some damping. Table 3.1 defines 
the essential variables. 

•LSAD stands for Less Singing Ami Dancing; a take off of Spar Aerospace’s ASAD: AH Singing And Dancing. 


27 




Table 3.1. Variable Notation for One Joint EOM 


Variable 

Description 

P 

force acting on the beam 

r ; 

torque acting at the joint 

w 

displacement of the beam tip 

Wf 

deflection of the beam tip due to flexure 

W J 

deflection of the beam tip due to joint angle 

a 

bending flexibility of the beam 

7 j 

joint angle 

L 

length of the beam 



Figure 3.2. One Joint Illustration: Point Mass on a Massless Flexible Beam 
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From these definitions 


w 

= Wf + Wj 

(3.1) 

W J 

= Ljj 

(3.2) 

Wf 

= ap 

(3.3) 

T i 

= Lp 

(3.4) 


Therefore 


= Ljj +ap= Lyj + a-j- 


(3.5) 


From Newton’s second law, in the absence of external forces, the force at the tip, or the 
torque over the length, is equal to the acceleration of the mass 


p — —mw 

When combined with Equation 3.4, this can be recast as 


(3.6) 


w = — 


1 

mLs 2 Tj 


(3.7) 


Combining Equation 3.7 with 3.5 obtains 


(3 - 8) 

Equations 3.7 and 3.8 together achieve the objectives of determining the end effector posi- 
tion, u>, (position only for the one joint case) and the joint angle position, 7 j, given a torque 
input to the system, Tj . These two equations can be rewritten as 


w = — s 2 m 1 L 1 Tj 


7j = - 


r 2 L-'m~'L-' +L-'aL-'\ r, 


(3.9) 

(3.10) 


Though this representation seems notationally verbose, it will help motivate the six joint 
case. The extension of this problem to six joints involves more complex notation, but essentially 
the same concepts. 
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3.1.2 Equations of Motion for the Six Joint Case 


In extending this problem to six joints and three dimensions, it is necessary to introduce 
vector and matrix notation to keep track of positions and rotations. Furthermore, we will add 
the effects of external forces caused by Shuttle attitude thruster firings to the system. 

As before, u7, will define the state of the end effector in an Orbiter-fixed reference frame. 
Now, however, in three dimensions it is composed of a displacement vector and a rotation vector 



(3.11) 


where q is the vector defining the translational deflection of the end effector due to the applied 
load. Similarly, 6 is the vector defining the angular deflection of the end effector on each axis 
due to the applied load. Since Position Hold Mode entails small variations about equilibrium, 
rotations can be described by vectors. Table 3.2 makes similar definitions as before. 


Table 3.2. Variable Notation for Six Joint EOM 


Variable 

Description 

P 

applied force from the payload at the end effector 

r 

applied moment from the payload at the end effector 


vector of servo torques at each joint 

w 

end effector state vector 

Wf 

end effector deflection due to flexibility of the arm 

W i 

end effector deflection due to rotations of the joints 

A 

6 by 6 flexibility matrix 

7 

joint angle vector 

L 

6 by 6 Jacobian matrix 


As before, w may be decomposed as follows 


W = Wj + Wf 


( 3 . 12 ) 
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where 


Wj = L-y 

and 


(3.13) 



The six joint analog of Equation 3.4 is 


(3.14) 



T 


where Ty is a vector of the servo torques at each joint, 
follows 


(3.15) 


This expression can be inverted as 


Combining Equation 


L 1 Ty = 


3.12 with 3.16, 3.13, and 3.14 gives 


(3.16) 


w = Lj 4* AL * Ty (3.17) 

The six joint version of Equation 3.6 with the effects of external forces added is 

P 

T 

where M is the system mass matrix and ud is the vector representing the effect of control 
torques from Shuttle attitude thruster firings. Combining Equations 3.16 and 3.18 obtains 


= —s 2 Mw + tiy; 


(3.18) 


w = — s 2 M 1 L / Ty + s *ud (3.19) 

Placing Equation 3.19 into Equation 3.17 results in 

7 = — \i s~ 2 L~ 1 M-'L~ r + ZrMZ,- 7 '] + s - 2 L~ 1 M" 1 u l} (3.20) 

Equations 3.19 and 3.20 are the six joint analogs of Equations 3.9 and 3.10. These together 
define the equations of motion for the Orbiter/S RMS/Payload subsystem. Given the torques 
produced by the control servos, and disturbances caused by Shuttle attitude thruster firings, 
the state vector defining the end effector position and attitude as well as the joint angle vector 
are determined by the above system of equations. 
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3.2 Modelling and Linearization of the Control Servos 


The general approach of this Section is to characterize the transfer function between the joint 
angles, 7, and the joint rate commands, y c} to the torque output at the joints, r 7 . Section 3.2.1 
linearizes this nonlinear system, while Section 3.2.2 characterizes the effect of significant non- 
linearities. 

3,2.1 Control Servo Dynamics 

The highest fidelity model of the SRMS servos available at Draper Laboratory is shown in 
Figure 3.3. This model was designed to be accurate over a large frequency range and a wide 
envelope of operations. This model is faithful enough to even encompass the accelerations of 
the motor shaft while it transitions from forward driving to back driving 2 using simulation time 
steps of 2 milliseconds [8]. References [8] and [10] show that for the low frequency oscillations 
that occur with heavy payloads, much of Figure 3.3 can be eliminated. 

References [8] and [10] both derive linearized models of the servo, but for slightly different 
purposes. They do, however, provide useful insight into the nature of the nonlinearities as well 
as the conditions in which they can be ignored. Figure 3.4 represents the linearized model of 
the servos. Table 3.3 defines the variables of Figure 3.4. All quantities are either vectors or 
matrices since there are six servos. The servos are all similar in configuration; differences occur 
only in the values of the gear ratios, current limiters, moments of inertias, and gearbox stiffness. 

Heavy payloads are assumed in modelling the arm dynamics. Draper’s experience with the 
SRMS in Position Hold mode demonstrates these conditions imply low frequency motions with 
moderate amplitudes. This implies we can neglect high frequency dynamics. In linearization, 
the effects of gear backlash, stictions, and frictions are ignored since they have negligible effect. 
Simulations have shown that the integral trimmer has an effect only on motion when friction 
is important; therefore, the trimmer and its limiter are eliminated from the linear model. The 
analog tachometer with its high pass filter has negligible effects on low frequency motions, so it is 
eliminated as well [8]. The digital tachometer’s quantization effects are ignored; it is treated as 
a linear gain. While in Position Hold, the current limiter saturates before the torque limiter so 
the torque limiter can be eliminated from the model. Indeed, the most significant nonlinearities 

^Forward drive occurs when the motor is driving the payload. Back drive occurs when the payload is driving the 
motor. 
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Table 3.3. Variable Notation for Servo Parameters 


Variable 

Description 

Units 

7 

Joint Angle Vector 

radians 


Vector of Torque at each Joint 

ft-ib 

ic 

Joint Rate Commands 

rad/sec 


Motor Shaft Angle Vector 

radians 

% 

Motor Shaft Rate Vector 

rad/sec 

N 

Gear Ratio 

- 


Motor Speed Command Scaling Gain 

count /rad/sec 

K da 

Digital-to-Analog Converter Gain 

v/cnt 

K a 

Motor Drive Amplifier Gain 

v/v 

Kr/Ri. 

Motor Torquing Gain 

ft-lb/ v 

K n 

Effective Gain of Digital Tachometer Processing 

cnt/rad/sec 

Kh 

Motor Back EMF Constant 

v/ rad /sec 

Km a 

Gain of the Forward Path: K daK aK\'I R f, 

ft-lb/cnt 

Kh'H 

Gain of the Feedback Path: Kn - f- Kh/KdaKa 

cnt/rad/sec 

J M 

Motor Rotor and Gearbox Input Shaft Moment of Inertia 

slug-ft 2 

Ka 

Gearbox Stiffness 

ft-lb /rad 
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while operating in Position Hold with a heavy payload are the motor current limiter and the 
gearbox stiffness curve. These cannot be dismissed as easily as the other nonlinearities. Ignoring 
their effects could lead to controller designs which do not achieve robust performance or stability. 

In an attempt to capture the effect of these nonlinearities, uncertainty blocks are used. 
Combining the gains in the forward path of the servo leads to a single gain, Kma. This gain 
includes the effect of the current limiter. Since it has the potential to saturate via its limiter, 
it is represented in Figure 3.4 with an uncertainty block, A, around it. Gearbox stiffness, K(;, 
with it’s nonlinear behavior has a A block around it as well. The mathematical characterization 
of these A blocks is discussed in the next Section. 

Figure 3.5 combines the results of the servo dynamics and the arm dynamics in block 
diagram form. This linearized version of the SRMS has been dubbed SSAD 3 . If not for the A 
blocks, it would be short work to put this linear plant model into state-space form 4 

x p = A p x p + B p u 

y = CpXp + DpU (3.21) 

where x is the plant state vector, y is the observation vector, and u is the control vector. 
Equation 3.21 represents the nominal state-space form of SSAD. 

For the development of regulators in subsequent Chapters, measurements of the end effector 
position and attitude for feedback are assumed. This selection of measurements provides the 
necessary observability of the structural flexibility existing in both the SRMS booms and the 
3 SSAD stand* for Some Singing And Dancing. 

4 From now on. bar* are left off vector* since all quantities are vector* or matrices unless specified otherwise. 
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gearboxes at each joint. There are several types of sensor suites that can recover this type of 
information, including force-torque sensors, strain gauges, or accelerometers. Eventually, the 
NASA/Johnson Space Center (JSC) plans to incorporate one of these set-ups into the SRMS 
system. 

3.2-2 Representing the Uncertainties 

The method of characterizing parametric uncertainty presented in this Section follows the 
work of Reference [11]. As shown in Figure 3.5, the state-space model of SSAD is a functioh 
of several physical parameters, two of which, I<a and Km A, are uncertain. Using the form of 
Equation 3.21 it is possible to model these parameter errors as 


= + £ AC, 6 ^ * p + (d p + Xj A DiS^j u 


(3.22) 


where each Si represents a parameter error that has been normalized as 


-1 < Si < 1 Vi 

As this suggests, the nominal value of both Ka and Km A must be bounded equally from 
both above and below. Consider Ka first. 

The gearbox stiffness curve is shown in Figure 3.6. The gain, Ka y is the slope of the curve 
at the point <f> — 7 . This slope can achieve values from zero up to a maximum slope that is 
determined by the servo under consideration. One choice that captures all possible values of 
Ka is to choose the nominal value as half the maximum attainable slope with bounds above and 
below of also half the maximum slope. This, however, will introduce problems when performing 
robust analysis. Allowing a zero value for Ka is essentially allowing for the possibility of the 
servos to be cut off from the rest of the system. All of the servos’ associated dynamics would 
become both uncontrollable and unobservable to any controller. Such a system will always fail 
robust stability analysis. This, however, is a mathematical fiction. The system’s manifestation 
in a zero Ka state is transient, not a situation which the controller must deal with for any 
length of time. A gain of 0.01 is used as the minimum value of Ka in order to pose the bounds 
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Figure 3.6. Gearbox Stiffness Curve 

in a better conditioned manner. For a value of Ka below 0.01 we are assuming that the system 
is either in a state of transition or in a relatively unexcited state. Ka is selected in the 
center of this range. 

The current limiter, shown in Figure 3.7, is a unit gain until it saturates. After saturation, 
the limiter’s behavior can be characterized as a gain that is inversely proportional to the input 
current. The minimum gain achievable, K m i n , is the current limit value divided by the maxi- 
mum current possible in the system, both known quantities for each servo. Since the current 
limiter has been absorbed into Kma, a will have a range from K m inK m a up to Km a* 
Km Ana™ * s a l so selected in the center of this range. 

There are now twelve parameters, six Ka and six Km a, nominally centered in ranges, but 
free to vary independently of each other. To calculate every possible AA*, A B*, AC* and AD, 
these parameters would generate through their ranges would be computationally intensive. As 
a first order approximation, A A*, A B*, A C{ and AD* are computed by sending each parameter 
individually to its upper and lower bound to reduce computation. This will generate a set of 
24 perturbed matrices. 

The system can now be written in the perturbed state-space form of Equation 3.22. The 
perturbation matrices associated with each uncertain parameter can be combined into a single 
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Figure 3.7. Current Limiter 


Ni = 


g < J£( n *+ n y) x ( n *+ n «) 


(3.23) 


matrix 

r A A t A Bi 
AC, A Di 

where n x = dim(x), n v = dim (y), and n u = dim(u). This matrix is not full rank for the SSAD 
system since any one parameter does not affect all of the states and outputs. JVj can therefore 
be decomposed as 

Ni = Ql f 5. T, 1 (3.24) 

Ri J L J 

where for Rank(JV,) = n it Q, € ft n * xni , ft 6 S, € ft"’*"', 7) 6 ft"’*"”. The 

state-space model of the perturbed system can be rewritten as 


k 

I 

1=1 


Ap + 


= A p x p + [Q^ ... Q*] 


y = 


k 

i= 1 


Cp + X>,M„<S. 


X 7> + 

- Tlk 

X P + 


k 

Bp + J2Q iSiI * Ti 

i=i 


+ Bn« 


■=i 


(3.25) 
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— CpXp 4- [/2i ■ • . f?*] 


7i 


Ik 


+ 


(3.26) 




L 

' J 


<4 

= 

’ 5 , ' 

x p + 

T\ 

_ Ck m 


. . 


. Tk . 


(3.27) 


The 6, represent the inputs to the uncertainty block, A. They are a function of x p and u. 
The t)i represent the outputs of the uncertainty block which will influence the state dynamics. 
This models the effect the parameter uncertainties have on the plant dynamics. Define 


Q = [Qi . . . Q k ] 

R = [iii • • . Rk] 

s = [Si ... Si) 

T = [TT ... Tl) 

Augmenting the unperturbed state-space of Equation 3.21 with process noise, xv ) measure- 
ment noise, i/, and the effects of parameter uncertainty as defined above gives 


Xp — ApXp -j- BpU + B Uf w 4* Qt) 

y = CpXp + D p u + D u v 4- Rrj 

c = Sxp + Tu (3.28) 


Figure 3.8 represents the general transfer function of a nominal plant, P(s), and compen- 
sator, K(s), with parametric uncertainty, A. In this representation, d represents all exogenous 
inputs into the system; in this case 


d = 



(3.29) 


L " J 

and e is an error vector of signals of interest to be defined later. The state-space representation 
of Figure 3.8 is 
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Figure 3.8. Standard Form for a Compensated Plant with Uncertainty 


x = Ax + Bir] + B^d + B^u 
c = Ci x 4- £*n rj + D^id + 
e = C^x + Di i tj + Dud + 

y = C^x + D^i 7 ] ~h H- (3.30) 

Placing Equation 3.28 in the form of Equation 3.30 gives 


i p 


A p 

Q 

b 2 

B P ' 


x p 

€ 


s 

0 

0 

T 


n 

e 


c 2 

D 21 

D 22 

D 2.1 


d 

y 


Cp 

R 

Dm 

Dp . 


u 


Bi — [B w . 0( na .xn v )] 

Dn2 = [^(nyXii,,,) : By] 

Equation 3.31 represents the state-space equation for the linearized SRMS with the nonlin- 
earities Km A and Ka modeled as parameter uncertainties. One clarification needs to be made 
about the matrix B 2 . This matrix characterizes how the process noise affects the state. This 
matrix comes from the u/; term of equation 3.19. u/; represents the effect of Shuttle attitude 
thruster firings on the end effector. Recall that while in Position Hold mode the Shuttle is 
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restricted from using its translational thrusters, therefore u/; will effect only the rotational 
states. However, for computation of a linear optimal estimator such as one employed by model 
based compensators, every state must be controllable by the noise. To avoid problems before 
they occur, the effects of fictitious Shuttle translational thruster firings are added to £ 2 . 

The actual construction of u/; depends upon the Orbiter/Payload configuration. On a given 
Shuttle mission, pre-flight evaluations select a certain set of jets to fire for a roll, pitch or yaw 
command for a given Orbiter/Payload configuration. Barring a system failure, those same jets 
are used every time. Since the force a particular jet will impart to the end effector can be 
calculated, the disturbance vector, 1 1 /;, has six elements corresponding to a roll, pitch, yaw, 
x translation, y translation, or z translation command. Such a command is normalized to lie 
between -1 and 1 corresponding to, for example, a -roll command or a -(-roll command. 


3.3 Comparison of the Linear and Nonlinear Plant 

There are several simplifications made in developing the linearized SSAD from the nonlinear 
LSAD. This Section compares the two simulations to expose the deficiencies in the linearization 
process. Frequency domain information such as singular value decompositions are not available 
for the LSAD. Instead, comparisons are made in the time domain while running both systems 
in Position Hold Mode. Figure 3.9 shows a block diagram of SSAD in Position Hold Mode. 
The Position Hold controller does nothing more than feed back the joint angles, 7 , to the servos 
through a set of gains, Ky • 

The Hubble Space Telescope (HST) in its release configuration is selected as the payload on 
a standard Orbiter. HST weighs 25,000 lbs, so the heavy payload assumption is not violated. 
HST in its release configuration will be used in subsequent Chapters for all designs since when 
this research began, ample data was available on this payload in this configuration. 

Recall that both Km a and Ka have a range of values. The nominal state-space, P nam (s ), 
places both Km a and Ka at the midpoint of their respective ranges. Since P n om( s ) is the 
plant that will be used in controller design, P n om(s) in Position Hold Mode is compared with 
LSAD. 

Both SSAD and LSAD are run in a Position Hold closed-loop with initial conditions on 
end effector velocity, w. This scenario models the situation occuring when the SRMS drops 
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servo dynamics arm dynamics 



commands 


Figure 3.9* SSAD in Position Hold 


into Position Hold Mode following a maneuver; the SRMS is instructed to maintain it’s current 
position and attitude though residual velocities remain from the maneuver. Recall that the end 
effector state vector is constructed as follows 


(3.32) 


X position " 

Y position 
q Z position 

6] Roll attitude 

Pitch attitude 
L Yaw attitude J 

Figures 3.10 and 3.11 compare the translational and rotational response of the end effector 
to an initial velocity for both LSAD and the nominal SSAD. Both amplitudes and frequencies 
are similar. Considering the linearization process, SSAD compares reasonably well with LSAD. 

Recall that Figures 3.10 and 3.11 represent the nominal SSAD plant. In the nominal plant, 
both K\ia and Ka have been placed in the center of their ranges for design and robust anal- 
ysis purposes. Hopefully, the uncertainty ranges on these gains capture the behavior of the 
nonlinear plant. Following this logic, there should exist off-nominal values for both K m a and 
K ( ; that will make the comparison between LSAD and SSAD more favorable for these initial 
conditions. Looking at Figures 3.10 and 3.11 it seems that SSAD is damping quicker than 
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End Effector X Position 



Figure 3,10. Translation Response of LSAD and Nominal SSAD to an Initial Velocity 
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End Effector Attitude: Roll 



Figure 3.11. Rotational Response of LSAD and Nominal SSAD to an Initial Velocity 
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LSAD. After moving K(; towards the lower end of its range to compensate this damping, and a 
little experimentation with increasing K \i a as well, Figures 3.12 and 3.13 present an improved 
match between LSAD and SSAD. 

3.4 Conclusions 

This Chapter uses several assumptions to bring the SUMS Position Hold plant into a 
workable linear form. SSAD model accuracy is restricted to payloads greater than 5,000 lbs. 
This is not a crippling assumption. The 25,000 lbs Hubble Space Telescope is used for design in 
this thesis. Other assumptions are made in linearizing the nonlinear plant. Several nonlinearities 
are ignored whose effects are small in the envelope of design. That is, they either have negligible 
contribution, or dominate at high frequencies. Two significant nonlinearities, Km a and K<; y 
are represented as parameter uncertainties. These steps are all necessary to obtain a workable 
plant model such that linear control design techniques can be applied. The drawback, however, 
is that even if robust stability can be guaranteed on SSAD, this guarantee will not translate 
to the full nonlinear plant. SSAD can provide a reasonable indication, but not an assurance of 
robust stability. The standard three block form for design and robust analysis presented in this 
Chapter will be exploited in subsequent Chapters to provide easy ways to compare controllers 
designed by different techniques. 
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Figure 3.12, Translation Response of LSAD and off-nominal SSAD to an Initial Velocity 
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Chapter 4 


Design and Analysis Approach 


The 24 state linearized model of the SRMS developed in Chapter 3 is used in subsequent 
Chapters for regulator design using three techniques: LQR, H*i , and H 0 0 optimization. Before 
any of these techniques are pursued, it is appropriate to consider the characteristics of the 
open-loop plant and their possible effects on the design process. This Chapter examines several 
aspects of the plant and anticipates how some can cause problems in both the design and 
analysis process 1 . Stability and performance criteria upon which each regulator design are 
evaluated is introduced as well. 

Section 4.1 examines the open-loop plant and suggests advantages of precompensation loop 
closure. Model reduction is applied to the open-loop plant to reduce it to 18 states, thereby 
reducing the order of any resulting compensator. The singular values of the reduced order 
system are compared to the full order system. The poles and transmission zeros of the reduced 
order plant are evaluated. Issues such as controllability and observability are also considered. 

Section 4.2 concentrates on analysis of the open-loop plant. The criteria considered are 
the same that are applied to each compensator design in order to provide a common measure 
for evaluation. The singular values of the transfer functions from disturbance to end effector 
position and attitude are inspected. The transfer function from disturbance to control effort is 
scrutinized as well. In face of the parametric uncertainty present in SSAD, robust stability is 
considered via the small gain theorem. As a less conservative method, the structured singular 
value, /i, is also used in analysis. Due to the nature of the uncertainty, fi - analysis is still 

'This may seem like foresight, in t.he thesis, though it is actually hindsight in the research. 
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conservative. The impact this will have on the analysis process is explored as well. 

4.1 Obtaining a Design Plant 

The first inclination when designing a new compensator for a system is to remove the 
old compensator and then proceed to the design process with the resulting open-loop plant. 
Such an open-loop plant was presented in Figure 3.5. The poles of this open-loop plant are 
plotted in Figure 4.1. With no feedback to the system, as expected, there are six poles residing 
at the origin. These rigid body modes, though not welcome, would be accepted except that 
computationally they do not appear exactly at the origin. Machine precision places them at 
some small e on either side of the jo/- axis. This e propagates through the design process causing 
computational problems later. 

One method of dispensing with this dilemma is to leave the joint angle feedback loop intact. 
The current Position Hold controller as shown in Figure 3.9 consists of only a gain matrix. 
Leaving the current Position Hold loop in place and designing around it can be thought of as 
precompensation loop closure. Since there are no dynamics associated with the Position Hold 
gains, designing around them will not increase the order of a resulting compensator. There are 
several advantages to this precompensation loop closure. 

Each joint operating in the current Position Hold Mode behaves as a spring with damping; 
the joint angle feedback of Position Hold will pull the origin poles decisively into the left 
half plane. This not only alleviates the computational problems discussed above, but it also 
produces a non-singular system A matrix. Invertability of the system would allow for the 
application of loop-shaping techniques in the LQG/LTR methodology such as those presented 
in References [13] and [12]. Furthermore, when designing an enhancement to a system already 
in use, it is conceptually more palatable to design the upgrade as stability or performance 
augmentation such that the improvement will work around the current system rather than 
replacing it. If necessary, the augmentation to the software can be dissabled leaving the orginal 
system intact. 

For the reasons outlined above, the current Position Hold system is treated for all intents 
and purposes as the open loop plant. Henceforth, in all discussion of analysis of the open-loop 
plant, it is the original Position Hold system that is actually being considered. The poles of 
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this system are shown in Figure 4.2. 

The six distant poles, which correspond to the dynamics of the motor rates, are considerably 
faster than the rest of the system. They are even faster than the fastest possible sample rate 
of 80 milliseconds in the actual system. Eliminating the fast poles will reduce the model to 
18 states and thereby reduce the order of a compensator. As long as the bandwidth of a 
compensator designed from the reduced order model (ROM) is kept relatively small, there will 
be no interference. To insure problems do not occur with performance or stability as a result 
of the model reduction, the ROM is used for controller design purposes only. The full order 
model (FOM) is always used in analysis. This will isolate stability robustness problems to the 
parameter uncertainties rather than confusing the issue with neglected dynamics. Section 4.1.1 
examines a method for eliminating the fast dynamics of the system, while Section 4.1.2 explores 
the characteristics of the ROM and compares it to the FOM. 


4.1.1 Model Reduction 

Denote the transfer function from joint rate commands to end effector position and attitude 
as 

G(s) := (A,5,C,D) 

This can be decomposed as 

G( s ) = [<?(*)], + [<?(*)]/ (4.1) 

where 

[<?(*)], := (A u ,B u Cu^) 

represents the slow modes of the system, and 

[G( s )]/ : = (^22, ^2, 

represents the fast modes to be cut from the system. Finding the matrices which define the 
decomposed system requires the use of the algorithm presented in Reference [14] in the manner 
outlined below: 

Using an ordered Schur decomposition, find a unitary matrix, V such that 


A = VAV r 


A^^ A-\ 2 

0 A22 


(4.2) 
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Consider A, to be the i ih eigenvalue of some matrix, A. Using the ordered Schur form, one 
can find An and A 22 such that 

|a, (^ 11 ) I < | (4.3) 

For SSAD, the system is divided into its 6 fast modes and 18 slow modes. Next, solving the 
following equation for X 



4.1.2 Characteristics of the ROM 

Figure 4.3 plots the singular values of the ROM and the FOM. No information is lost until 
well beyond the bandwidth of the open-loop system. In the lower frequency range where the 
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ROM: Solid FOM: Dashed 



Figure 4.3. Singular Values from Joint Rate Commands to End Effector State for the ROM 
and FOM 

system has significant gain, the two models are identical. Whatever DC effect the fast poles 
of the FOM had on the system is preserved in the reduction process. Also evident from the 
singular value plots is the poor scaling of the system. This can be seen from the distance 
between individual singular values. This ill-scaling is to be expected since the end effector 
state vector is comprised of translations measured in inches, and rotations measured in radians. 
Certainly scaling must be applied before design so that a regulator will not consider a one inch 
error and a one radian error with equal gravity. 

Figure 4.4 displays the poles of the ROM. The 18 poles of the ROM reside in the same 
locations as the near poles of the FOM as shown in Figure 4.2; as desired, the fast dynamics 
have been eliminated. 

Figure 4.5 bears silent witness to the fact that the model is non-minimum phase. Note, 
however, that all of the transmission zeros are exceedingly fast relative to the rest of the 
system. 

We can study the controllability and observability properties of the system since SSAD in 
its nominal configuration is a linear time-invariant system. Consider the following gramians: 
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A necessary and sufficient condition for controllability is that the controllability gramian 

L r (t) = J* {e Ai BB r e AT, }dt (4.9) 

must be full rank. Dually, a necessary and sufficient condition for observability is that the 
observability gramian 

L„{i) = J* [e ATt C r Ce At }di (4.10) 

must be full rank. If t is allowed to go to infinity and A is stable, then L r and can be found 
via the following Lyapunov equations: 

AL C + L r A r + BB 1 = 0 (4.11) 

A r L„ + L„A + C r C = 0 (4.12) 

Evaluating these gramians for the ROM ascertains the system to be fully controllable and 
observable. Recall that the end effector position and attitude are the measurements used for 
the new regulator designs. The controls are the six joint rate commands. Table 4.1 presents the 
singular values for the controllability and observability gramians. Though the system is fully 
controllable and observable to machine precision, this can be slightly misleading. Looking at 
Table 4.1 some modes are clearly more controllable and/or more observable then others. This 
implies that certain modes may require exorbitant amounts of control effort if they are to meet 
certain performance requirements. Fortunately, however, all of the modes are stable. 

All of these issues are important to keep in mind during the design process. There will be 
limits to how far certain poorly controllable poles can be placed because of restrictions on the 
control effort. Keep in mind that the reduced order model presented in this Section will only 
be used for design purposes, the full order model will be used for analysis. 

4.2 Analysis Techniques and the Open-Loop Plant 

Before proceeding to design a new regulator to augment the performance of the current 
Position Hold controller, let us consider Position Hold performance as it stands now. This 
Section will analyze the current system following a threefold path. First, Section 4.2.1 considers 
the transfer function from disturbance to end effector position and attitude respectively to 
determine system performance. Second, Section 4.2.2 examines the transfer function from 
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Table 4.1. Gramian Singular Values 


Controllability Gramian 
Singular Values 

Observability Gramian 
Singular Values 

2.0767e+07 

2.1420e+05 

7.0207e+05 

1.4658e-01 

2.0295e+07 

1.3638e+05 

3.6847e+05 

7.7801e-02 

7.1140e+06 

8.5671e+04 

1.0944e+04 

3.8524e-02 

1.8852e+06 

6.6693e+04 

4.3336e+00 

3.4467e-02 

1.8332e+06 

4.7181e+04 

1.7348e+00 

1.6510e-02 

1.4352e+06 

2.6289e+04 

1.2482e+00 

4.7992e-03 

9.2866e+05 

2.7779e+00 

4.8469e-01 

9.0874e-04 

3.5116e+05 

1.1020e+00 

2.6327e-01 

2.4196e-04 

2.6732e+05 

8.8740e-01 

1.7180e-01 

1.5619e-05 
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disturbance to control effort in order to establish how much control effort is required. Finally, 
Section 4.2.3 looks at the stability robustness of the system. This method of system analysis is 
the same procedure applied to every new controller designed in subsequent Chapters. 

4.2.1 Performance 

In evaluating the effect of a disturbance on the end effector state, only the rotational Shuttle 
commands are considered since the effect of fictitious translational jets were added only so that 
mathematically every state would have process noise. In fact, for Shuttle RMS operations, 
translational commands are prohibited. As pointed out in Section 3.2.2, the disturbance is 
normalized to lie between -1 and 1; let us consider the singular values from disturbance to the 
end effector in two sets. The first set will include the end effector position in inches and the 
second set will consist of the end effector attitude in radians. In this manner the singular values 
will retain their units making the evaluation of performance an easier task. Figure 4.6 displays 
these two sets of singular values for the current Position Hold regulator. 

From Figure 4.6 we can see that the end effector excursion from equilibrium can be as great 
as 5 inches in position and as much as 0.03 radians in attitude for certain Orbiter rotation 
command directions. The goal in designing a new regulator is to bring these maximum excur- 
sions down as far as possible without exceeding the limitations on the control effort described 
in the next Section. Ideally we aim for less than 1 inch excursion in position and less than 0 01 
radians of rotation excursion. 

4.2.2 Control Effort 


There are already in place certain joint rate command limits, ie., limitations on the control 
effort. These limits are not based on any hardware constraints, rather they are safety settings 
to insure a payload at the end effector isn’t commanded too rapidly bringing the system into 
precarious areas of operation. These settings are a function of the payload inertia. The limits 
are calculated to maintain a prescribed stopping distance when operating the RMS. If new 
controllers can exhibit improved capability to control the end effector, there may be justification 
for increasing the joint rate limits. 

For design purposes, the joint rate limits will be treated as hard constraints that must 
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Frequency (radians/second) 



Figure 4.6. Performance Singular Values of the Current Position Hold System 
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Table 4.2. Joint Rate Limits 


Joint 

Joint Rate Limit (rad/sec) 

Shoulder Yaw 

0.0052 

Shoulder Pitch 

0.0052 

Elbow Pitch 

0.0072 

Wrist Pitch 

0.0107 

Wrist Yaw 

0.0107 

Wrist Roll 

0.0107 


be met. In this manner, the lower bound on performance will be established. Increasing the 
joint rate limits will allow a new controller to use more control and thus allow performance to 
improve. Table 4.2 presents the current joint rate limits. 

To determine the control effort used by the system for a disturbance caused by Shuttle 
attitude thruster firings, consider the singular values of the transfer function from the attitude 
firings to each individual joint rate command. There are six joints and six different joint rate 
limits. Let us scale each joint rate command by its corresponding joint rate limit. In this 
fashion, as long as the singular values of these six transfer functions remain below unity, the 
joint rate limits have not been exceeded. As Figure 4.7 shows, the current system is safely 
below the joint rate limits. 

Keep in mind that all performance and control effort analysis performed so far has been 
on the nominal plant. Recall there are two parameters, Km A and K( that include additive 
uncertainty. The next issue to consider is robust stability in face of this uncertainty. 

4.2.3 Stability Robustness 

Recall the standard form for a compensated plant with uncertainty shown in Figure 3.8. The 
closed-loop version of this is shown in Figure 4.8. G(s) is the closed-loop transfer function which 
combines the dynamics of the plant and the compensator and A is the modeling uncertainty. 

The closed-loop transfer function, G(s), can be divided into a set of four transfer functions 
aw follows, 
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Frequency (radians/second) 

Figure 4.7. Control Effort Singular Values of the Current Position Hold System 
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Figure 4.8. Closed-Loop Transfer Function 
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Figure 4.9. Closed-Loop System for Robust Stability Analysis 


f Gll( s ) <?12( s ) 

e G 21 W @ 22 ( 3 ) 

In this representation, the nominal closed-loop system is given by G^U)- Let us redraw 
Figure 4.8 as shown in Figure 4.9. In this representation, looking at the A and Gn(s) transfer 
functions, the small gain theorem can be invoked. 

By invoking the small gain theorem, a sufficient condition for closed-loop stability is 

||G,i(«)A|| 00 <l (4.14) 

The Hoc norm is defined in the frequency domain for a stable transfer function matrix as 


(4.13) 


IIWIloo 


supff-(GO'w)] 


Ml 

IMMo IMk 


(4.15) 


Therefore, the norm is the induced 2-norm. 

In Section 3.2.2, A was defined to be normalized to less than one. Therefore, Equation 4.14 


can be reduced to 

||C7ii(«)||oo<l 


(4.16) 


This small gain test is valid for any complex A with norm less than one. However, for indepen- 
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dent modeling uncertainties, A is known to have a block diagonal structure 


A = 


A, 0 ... 0 

0 a 2 

: 0 

o ... 0 A m 


For the SSAD plant, there are two block diagonal elements for K\ 1A and K( } . Since the small 
gain test must hold for all normalized A, not just the more restricted set of normalized block 
diagonal A matrices, A|>, it is a conservative test. Because it is known the perturbations of 
SSAD are real parameter variations, testing for complex perturbations makes the small gain 
test even more conservative. 

In an effort to supply a less conservative robust stability test, Doyle [15] developed the 
structured singular value, fi.fi is defined as 


fi[G{ju)\ = i ° if det ^ “ G 0 W ) A 0' W )] # 0 for a11 A € A,, 

{min^gAn d’fAQw)]} 1 det[7 — G(jur)A(ju;)] = 0 otherwise 

(4.17) 

Consider |i[(?n(jw)]. From the definition of fi, if fi\G\ i (j <*> )] > 1 at some frequency ui then 
there exists a block diagonal A(ja>) such that 

<7[Ai(ju>)] < 1 and det[7 - G\ i ( ju) A(jw)] = 0 

Conversely, if ^i[G^^{ju))\ < 1 for all w, then the smallest block diagonal A matrix that makes 
the system singular has the gain 

<7 [A, (./'«)] > 1 

which violates the norm of the modeling error. Therefore, a necessary and sufficient condition 
for robust stability when A £ Ad is 


MGnlM] < 1 Vw (4.18) 

Unfortunately, in the actual application of /i-analysis we are reduced to conditions which 
are sufficient but not necessary for two reasons. First of all, fi is not a norm and it cannot be 
computed easily. Software is available that can calculate upper bounds for fi. However, using 
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an upper bound adds some conservatism to the analysis. The second source of conservatism 
again comes from the fact that SSAD has real parametric uncertainty. The set Ad still allows 
for complex perturbations as in the small gain test. This is a particularly bothersome problem 
for the SSAD plant which contains lightly damped poles with large error. With lightly damped 
poles, even small complex perturbations can cause the condition for robust stability to be 
violated, ie., f*[Gi'i(ju)] > 1. Consider the following simple example. 

Example 4.1: Spring-Mass-Damper System with Uncertain Spring Constant 

A spring-mass-damper system with a unit mass can be represented by the differential 
equation 

x + cx + kx = / (4-19) 

where c is the viscous damping coefficient given as 0.1, k is the spring constant, and / is a 
disturbance force. The spring constant is uncertain, but modeled to fall in the range 

0.5 < Jfc < 2.5 


If k is selected in the center of its range for the nominal system, a state-space representation is 
given by 


*2 


x 


0 1 
-1.5 -0.1 
Ax + Bf 





0 



+ 





1 


[/] 


(4.20) 


The nominal system has stable roots at A = — 0.05 ± 1.224*. In fact, throughout the range of k 
the system remains stable with roots at A = —0.05 ± 0.705i for the high end of the range and 
roots at A = — 0.05± 1.5803s for the low end of the range. The system is robustly stable in face of 
the parametric uncertainty. Nevertheless, since fi - analysis will consider complex perturbations, 
this system will fail the fi test for stability. 

Consider, as an example, the perturbation matrix 


A A = 


0 0 

i 0 
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Frequency (radians/second) 

Figure 4.10. Robust Stability Analysis of the Current Position Hold System 

whose norm is one. Though this perturbation cannot occur in the real system, analysis 
will test for such a A. This perturbation places the roots at At = 0.3394 + 1.284i and A 2 = 
—0.4394 — 1.284* failing robust stability. 

Though /i- analysis does not supply a necessary condition for robust stability, it does reduce 
conservatism over the small gain test. To illustrate this, Figure 4.10 displays the results of 
the small gain test and /i-analysis performed on the current Position Hold controller. Though 
^-analysis is less conservative than the small gain test, the plant still fails robust stability. 

To determine if the instability displayed in Figure 4.10 is the result of a fictitious complex 
perturbation, or a configuration the system can actually realize, it is possible to evaluate the pole 
locations of the system for different parameter values. To analyze every possible combination of 
the twelve parameters would not be feasible^ however, it is reasonable to check the endpoints. 

Consider when the end effector is very close to its equilibrium position and attitude. At 
these very small amplitudes, the current limiter has not saturated, hence all six of the K m a are 
at their maximum values. Furthermore, at very small amplitudes the joint angles and motor 
shaft angles are almost aligned, thus the six Kq are at their minimum values. Figure 4.11 
displays the pole locations of the system with the parameters Km A and K a at these endpoints 
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Real 

Figure 4.11. Open-Loop System Near Pole Locations With Ka = 0.01 and K m a = 
of their ranges. 

As Figure 4.11 shows, the instability indicated by /i-analysis is not fictitious. The system 
is not robustly stable for the full range of values the parameters can inhabit. If all six I<a are 
increased in unison at 0.01 increments, the lowest values that achieve stability are 0.05. This 
value brings the system poles just to the left of the jut- axis. This configuration is shown in 
Figure 4.12. Interpreting these observations indicates the SSAD plant is a system that is stable 
at moderate to high torque values when Ka is higher than 0-05 and unstable at very low torque 
values when Ka is below 0.05. Such a system would stably damp and then limit-cycle close to 
the equilibrium position; such behavior is observed in the actual system. 

To totally eliminate this limit cycle behavior with a regulator may require a large sacrifice 
of performance. In any case, it would be misleading to think that eliminating such behavior 
with a regulator on our model would do so for the actual system. Recall that in Chapter 3 fric- 
tional effects were neglected in SSAD. Such effects would certainly contribute to the limit cycle 
behavior. This is something to keep in mind when designing new compensators. Instabilities 
caused by very low Ka may not be a problem since they indicate slow limit-cycling of small 
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Real 

Figure 4.12. Open- Loop System Near Pole Locations With Ka = 0.05 and K\ia = K m 
amplitude near the equilibrium values which is unavoidable. 

4.3 Summary 

In this Chapter, the 24 state SSAD model has been reduced to 18 states for design purposes 
only. The 24 state model will be used for analysis purposes. Three criteria that will be 
considered for each design are the performance singular values, the control effort singular values, 
and robust stability of the plant. The analysis process was carried out on the current Position 
Hold controller as both an example of the process and the baseline for comparison. In analyzing 
performance, two sets of singular values were examined: disturbance to end effector position as 
well as end effector attitude. Control effort was determined by looking at the singular values of 
the transfer function from disturbance to control effort. Finally, robust stability was determined 
via the small gain test, and the less conservative structured singular value, /i. /i-analysis may 
still be too conservative a test since the SSAD plant is restricted to real parametric variations. 
To determine this, we resorted to eigenvalue evaluation at certain parameter values. 
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Chapter 5 

LQR Design 


Linear Quadratic Regulator (LQR) design is very limited in its applications. This technique 
not only requires knowledge of every state, but also, this knowledge must be uncorrupted by any 
sensor noise. Since measurements of every state are not available, a linear quadratic regulator 
cannot be implemented on the actual SRMS without first designing an estimator to approximate 
the states. Nevertheless, it is useful to design a LQR because it gives an idea of the best we 
can hope to achieve in terms of minimizing the ‘2-norm’ of the error, ||e|| 2 . It is a simple 
first step that helps to motivate the model-based compensator (MBC) designs. In MBC design 
techniques, the states of the system are estimated from the measurements. These estimates are 
then used by a compensator. MBC design techniques are the topics of subsequent Chapters. 

The LQR technique is derived from optimal control theory. Kwakernaak and Sivan [16] 
provide the full details of this theory. Section 5.1 provides the main results of the LQR technique 
as it applies to the SRMS problem. 

Section 5.2 analyzes the LQR for the SRMS using the techniques presented in Chapter 4. 
The performance and control effort singular values are examined and compared to the open- 
loop system. A time domain simulation of a typical jet firing pattern is also presented. Finally, 
stability robustness is examined in face of the parametric uncertainty. 

5.1 LQR Theory 

LQR theory provides no direct method to design for stability robustness or robust perfor- 
mance, thus for design purposes Figure 5.1 eliminates the A block of Figure 3 8. The effects of 
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Figure 5 . 1 . Plant With Compensator 

the A block are considered after the design process is completed. 

The general state-space representation of Figure 5.1 is 

Xp Ap By i?2 Xp 

e = Cy Dyy Dy 2 d (5.1) 

y C2 A21 D22 u 

In the LQR design methodology, uncorrupted full state feedback is called for. Thus the 
disturbance vector, d, does not include sensor noise, only the process noise. Furthermore, C 2 
is the identity matrix. We will design the LQR compensator from the reduced order model 
presented in the last Chapter. For analysis, we will apply the reduced order compensator 
design on the full order plant. The result of an LQR design is well known to be a constant gain 
matrix, thus an LQR does not increase the order of the closed-loop system. 

The error vector, e, includes the states and the controls. 


The states are weighted by a symmetric and positive semidefinite matrix, Q, while the controls 
are weighted by a symmetric and positive definite matrix, R . Additionally, the modelling 
performed in Chapter 3 produced no direct feedthrough term, thus D 22 = 0. Hence for the 
LQR formulation of the SRMS, Equation 5.1 can be recast as 
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e 

y 

The LQR problem is to devise a feedback control law which minimizes the quadratic cost 
functional 

J = J [x p r Qx p + u' Ru^dt (5.3) 

The optimal feedback control law is 

u = -Gx p 

G = R-'B/K (5.4) 

where K is the symmetric, positive semidefinite solution of the algebraic Riccati equation 

A p r K + I<A P - KB p R-'B p r K + Q = 0 (5.5) 

In general, there are many solutions to Equation 5.5, but only one of them is positive 
semidefinite. The gain matrix K will exist and be positive definite provided that the system 
(A p , B p ,Q?) is stabilizable and detectable. In other words, any uncontrollable or unobservable 
modes are asymptotically stable. 

The closed-loop system dynamics become 

x p — A p x p 4“ B p u 4* B-\d 

u - -Gx p 

x P = [A p -B p G]x p + B^d (5.6) 

Furthermore, the closed- loop system is guaranteed to be stable, ie., 

ReAj [A p — B p G] < 0 Vi (5.7) 

The exact pole locations will depend not only upon the open-loop system (through A p and S p ), 
but also upon how the weighting matrices Q and R are selected. Thus Q and R are the design 
parameters that are tuned until a LQR design meets desired disturbance rejection and control 
effort requirements. 


By B p 


d 

u 


(5.2) 
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5.2 LQR Results 


For the SRMS problem, the only states of concern are the ones corresponding to the end 
effector position and attitude. To express this desire, Q is selected as a matrix of zeros except for 
the values along the diagonal elements corresponding to the end effector position and attitude 
states. 

Recall that the end effector position is measured in inches and the attitude is measured in 
radians. To scale the outputs, consider the singular values of the open-loop system as shown in 
Figure 4.6. The attitude maximum singular value is more than an order of magnitude below the 
position maximum singular value. The weights in Q are chosen to reflect this difference. The 
elements of Q corresponding to attitude are placed more than an order of magnitude above the 
position weights. In this manner a one inch error is not considered as serious as a one radian 
error. The actual values of the weights do not matter, only the ratio between them. Such a 
selection of Q also insures that the system (A p , B p ,Q?) is stabilizable and detectable. 

In selecting R y certainly every control must be weighted to insure the joint rate limits are 
not violated. As shown in Table 4.2, we already know the limit of control demand considered 
acceptable. As a starting point, choose the inverse of the joint rate limits as the initial weight 
and then iterate over the design process, tuning R each time until the singular values from 
disturbance to control effort fall just below the joint rate limits for each control channel. This 
procedure will result in the best performance possible with the LQR design technique given the 
limits on the control effort. 

Section 5.2.1 examines the control effort and performance singular values in the same manner 
as outlined last Chapter. A time domain simulation is included as well to help interpret the 
singular values. Section 5.2.2 examines the stability robustness of the LQR design. 

5.2.1 LQR Control Effort and Performance 

Figure 5.2 shows the singular values from disturbance to control effort after the iteration 
process. Each control weight is fine tuned until the scaled singular value falls just below 
one. Below 0.1 radians/second, the control effort has not changed much from the open-loop 
system. For disturbances occuring at 1 radian/second, the LQR system uses almost two orders 
of magnitude more control than the open-loop system. Nevertheless, the LQR system does not 
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Figure 5.2. Control Effort Singular Values of the LQR System 
exceed the joint rate limits. 

Figure 5.3 presents the singular values from disturbance to end effector position and attitude. 
Included on the same plots is the corresponding information for the open-loop system. For the 
LQR design we were able to meet the performance goal of less than 1 inch excursion in position. 
This is expressed by the maximum singular value falling below one for all frequencies. Thus 
for any input disturbance, its effect on the end effector position will be multiplied by a gain at 
least less than one. Since the input disturbance is already normalized to fall between -1 and 
1, the maximum excursion of the end effector position is less than one inch. Similarly, since 
the maximum singular value for the disturbance to attitude transfer function falls below 10 
we have guaranteed less than 0.01 radians of rotation excursion. Both goals are met without 
exceeding the joint rate limits. 

To better visualize what is occuring on the singular value plots, a time domain simulation 
is presented in Figures 5.4 and 5.5. These figures include both the response of the open-loop 
system and the LQR system. In this simulation a roll command is given for a full 10 seconds 
immediately followed by a 10 second pitch command and finally a 10 second yaw command. 

From Figure 5.4 we can see that the X direction has considerably poorer disturbance re- 
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Magnitude (PH: dashed LQR: solid) Magnitude (PH: dashed LQR: solid) 



Figure 5.3. Performance Singular Values of the LQR System 
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Figure 5.5. Rotational Response of PH System and LQR System to Attitude Thruster Com- 
mands 
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Figure 5.6. Robust Stability Analysis of the LQR System 

jection than the Y and Z directions. Nevertheless, significant improvement over the uncom- 
pensated system is also apparent. Similarly, roll disturbance rejection is considerably better 
than the other axes as shown in Figure 5.5. These plots indicate that while in HST release 
configuration, the SRMS can reject disturbances much easier in some directions over others. 

5.2.2 LQR Stability Robustness 

Applying both the small gain test and ^-analysis to the closed-loop LQR system results in 
Figure 5.6. Unfortunately the system fails both tests. Recall as outlined last Chapter that both 
tests are conservative for SSAD since the uncertainties are real parameter variations. Therefore 
it is possible that the system is in fact robustly stable in face of the uncertainty though it 
cannot be shown. 

Last Chapter, to confirm the results of /i- analysis, we evaluated the eigenvalues of the 
open-loop system for different parameter values. This technique conclusively showed that the 
open-loop system was in fact not robustly stable. As before, we can evaluate the eigenvalues of 
some combinations of the parameters to further our information on robust stability. 
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Real 


Figure 5.7. LQR Closed- Loop System Near Pole Locations for the Nominal and Off-Nominal 
System 

Consider the twelve parameters in two sets, the Km A and the Ka. After checking the 
four combinations of these two sets at the endpoints of their ranges, as we discovered with the 
open-loop plant, the worst possible combination is found to be the K ma at the top of their 
range and the Ka at the bottom of their range. The worst combination is defined to be the 
system in which the closed-loop poles reside closest to the ju>-axis, or furthest into the right 
half plane. Figure 5.7 presents the LQR closed-loop near poles of the nominal plant and the 
worst off-nominal plant. 

For these combinations, the system remains stable. Let us qualify this immediately; this 
does not prove robust stability. Though experimentation seems to indicate that the combination 
of the Km a at the top of their range and the Ka at the bottom of their range is the system’s 
worst, we cannot prove this conclusively. This is a multi-variable system with twelve varying 
parameters. There can be some other combination of parameters not considered that will result 
in an unstable closed-loop system. 

To further dampen our spirits, keep in mind that several nonlinearities have been totally 
eliminated in the development of SSAD. Thus even if robust stability is guaranteed on the 
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model, it is not guaranteed on the actual system. We will not bypass the extensive pre-flight 
simulation requirement of a Shuttle mission, 

5.3 Summary 

Using the LQR technique, we have designed a compensator that is capable of meeting the 
performance objectives within the allowable control effort. The resulting regulator fails robust 
stability analysis via both the small gain test and /i-analysis. However, actual eigenvalue 
evaluation with four combinations of the parameters at their endpoints seems to indicate that 
the system may be robustly stable. Unfortunately, this cannot be proven unequivocally. 

There are several drawbacks to the LQR technique. This technique not only requires knowl- 
edge of every state, but also, this knowledge must be uncorrupted by sensor noise. These 
requirements render an LQR design impractical for implementation of the actual SRMS. An 
estimator would be required to determine the states for the LQR. This would add noise to the 
state feedback vector which the LQR does not account for. If an estimator must be designed, 
it is better to use an optimal design method such as the LQG technique. 
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Chapter 6 


H 2 Design 


Last Chapter we saw that with full state feedback, performance goals were met without 
exceeding the control effort limitations. However, in the actual system we are limited to an 
observation vector, y, which is some linear combination of the states corrupted by noise. The 
solution to this more involved problem is given by the Linear Quadratic Gaussian (LQG) theory 
studied extensively in the 1960’s and 1970’s. The LQG solution is realized via the separation 
principle. The problem is reduced to two sub-problems, the solutions to which are well known. 

The first sub-problem is to obtain an estimate, x, of the state, x. This estimate is to be 
optimal in the sense that E j(x — x ) f (x — x)| is minimized. The solution to this problem is 
given by Kalman filter theory. Kalman filters have the structure of a state observer. They 
receive as inputs the plant input, u, and plant output, y, and produce as an output the state 
estimate vector, x. 

The second sub-problem involves using the state estimate from the Kalman filter as if it 
were an exact measurement of the states. As in last chapter, we solve the resulting determin- 
istic linear quadratic control problem. The state estimates are fed through the resulting state 
feedback matrix, thus producing the control for the complete system. The structure of the total 
LQG compensator is a Kalman filter in series with a state feedback matrix [12]. 

A consistent, but more suave term for the LQG problem is the H<i problem. This nomen- 
clature arises because solving the LQG problem is analogous to the minimization of the H 2 
norm of the transfer function from exogenous inputs to the error vector. Section 6.1 develops 
the H*i theory in this fashion, following the work of Reference [17]. This approach is used 
over the traditional presentation of the LQG problem because it utilizes the two block format 
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Figure 6.1. Plant With Compensator 

we used in previous Chapters. Aside from visualizing the problem in a simpler manner, this 
approach helps motivate the 0 problem in the culminating Chapter. The H a q 1 problem can 
be developed parallel to this approach. 

Section 6.2 analyses the results of the H 2 design using the same criteria as before. The 
singular values from disturbance to control effort and disturbance to the end effector states are 
examined and compared to the open-loop system. A time domain simulation of the same jet 
firing pattern used before is also presented. Finally, stability robustness of the closed-loop H 2 
system is examined including the parametric uncertainty. 

6.1 #2 Theory 

Recall the general block diagram of the plant, but with the A block removed, as shown 
in Figure 6.1. This is the plant with compensator where u and y are the control inputs and 
observation vector respectively, d represents all exogenous inputs to the system, d includes 
process and sensor noise for the H 2 problem, e is the vector of signals whose mean squared 
value we wish to minimize. As before we chose it to contain the control effort and end effector 
states. 

y 

e = 

u 

1 JIi and JJcxj are both TTardy spaces 
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P(s) is derived from the nominal plant model. Suppose that P(s) is partitioned as follows: 


e 


1 ( S )pi X mi 

P 12(s) piXm , 


d 

y 


^21 x mi 

7 , 22(^) p7 xma 


U 


We can eliminate u and y using u = Ky, to obtain 

e = [P u +PnK(I-PnK)-'Pn\d (6.2) 

It is convenient to adopt the shorthand for the linear fractional transformation of P and K 

r*(P,K) = Pu + P\*K(I - PnK)-' Pn (6.3) 


so that we can write 

e = T t {P,K)d (6.4) 

A simple way to state the H 2 problem is to minimize || J r f(P, K ) || 2 for stabilizing K over all 
Tt\P,K) £ The H 2 norm is defined in the frequency domain for a stable transfer function 
matrix, G(s) as 


im = J_ trace [ G (-i w ) / g 0 w )] <*•») 

Recall that the general state-space representation of Figure 6.1 is 


(6.5) 


i P 


A p 


b 2 

e 

= 

c, 

D11 

D 12 

y 


c 2 

D21 

D 22 


(6.6) 


In order to insure stabilizing controllers exist, the uncompensated plant, (Ap, i? 2 , C 2 ), as 
well as (Ap, } C^) must be stabilizable and detectable. The SSAD plant meets this condition 
and we must choose the error vector and its weights so that it too meets this condition. In order 
to ensure the realizability of resulting controllers we must also choose the weights on the error 
vector so that rank(Di 2 ) = ^2 and rank(D 2 i) = p? where m 2 and p 2 are given in Equation 6.1. 

Define the following transformation variables 


Zi = (Dn r Dn) * 
Zi — ^Z?21 A21 ' ^ 


(6.7) 



and apply them to the plant as follows 


5 2 

= B 2 z\ 

c 2 

— Z'lC'l 

b\2 

— D\ 2 z\ 

b^x 

= *2^21 


(6.8) 


Let us assume these transformations have already been applied, thus D 12 and D 21 have 
been normalized so that 


D u = [0 if and D 2 ^ = [0 /] 


(6.9) 


The point of these transformations is that we can reduce the complexity of the controller 
formulas as long as we can write 


D] 2 1 [Ci £>12] — [0 I] 


( 6 . 10 ) 


and 


D 2 \ 


D21 r = 


( 6 . 11 ) 


Equation 6.10 means that the penalty on e = Ci x^+D^u includes a normalized, nonsingular 
penalty on u. In other words, the error vector includes the control inputs with an identity 
weighting matrix and there is no cross weighting between the states and the control. 

Equation 6.11 is dual to Equation 6. 10. This equation flows from assuming that the dis- 
turbance vector, d ) includes both the process noise on the states and the sensor noise on the 
measurements. These noises are separate and the sensor noise weighting matrix is normalized 
and nonsingular. 

The transformations in Equation 6.7 allow us to choose non-normalized weights for D 12 and 
D 21 and then scale the plant so that Equations 6. 10 and 6.11 still hold. 
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Two additional assumptions made on the plant are that Du = 0 and D 22 = 0. We require 
the former for the H 2 cost to be finite; the latter simplifies the formulas. For SSAD D 22 = 0 
and we chose the error vector to not include the disturbance hence Dn =0. 

Thus the form of the system becomes 


x r 


A p 

By 

B 2 


x v 


e 

- 

Ci 

0 

Dyi 


d 

(6.12) 

y 


C 2 

D 2 1 

0 


u 



with 

D , 2 = [0 if and D 2 i = [0 I\ 


Before proceeding, let us define some notation. Let A, Q, and R be real n x n matrices. Q 
and R are both symmetric. Define the Hamiltonian matrix, H , as 


H 


A R 
Q -A 1 ' 


(6.13) 


The definitions of the symbols dom(Ric) and Ric (H) are that if H £ dom(Ric) and X = Ric (H) 
then 

(i) X is symmetric. 

(ii) X satisfies the algebraic Riccati equation 

A V X + XA + XRX -Q = 0 

(iii) A + RX is stable. 

In order to find an admissible controller K which minimizes \\!Ff (P,K ) || 2 , we must solve 
the following two Riccati equations given by their Hamiltonian matrices 


*2 


^2 


Ric 


A p —BiBz* 

-Ci 7 Ci -A p r 


A p r -C 2 7 'C 2 
-ByBf -A p 


(6.14) 

(6.15) 
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The two solutions, X 2 and Y 2) are positive semidefinite. Using these solutions, define 


F 2 = -B 2 r X 2 (6.16) 

L 2 = —Y 2 C 2 * (6-17) 


Notice that if the control weighting matrix is normalized to identity as we have done, F 2 is 
the optimal state feedback gain as solved for last in the Chapter with full state feedback. L 2 
represents the solution to the Kalman filter gain matrix. Now we may write the unique optimal 
controller as 


K(s) : = 


A p + B 2 F 2 + L 2 C 2 

-l 2 

F 2 

0 


(6.18) 


Considering that our plant is square, (six inputs and six outputs), we have an invertible 
system A matrix, and that the non-minimum phase transmission zeros are fast relative to the 
rest of the system, it is tempting to apply a loop-shaping technique. The Linear Quadratic 
Gaussian/Loop Transfer Recovery (LQG/LTR) method presented in References [12] and [13] is 
a loop-shaping technique that allows us to shape the singular values at the input or output of 
the plant. 

Unfortunately, in the LQG/LTR methodology, the ways in which the process noise and 
sensor noise enter the plant are treated as tuning parameters rather than as representations 
of the real plant behavior. In terms of Equation 6.6, B 1 and D 2 \ become design parameters 
which are adjusted along with the weights on the states and the controls to shape the singular 
values. For SSAD the process noise, Shuttle jet firings, does enter the system in a prescribed 
manner. LQG/LTR can still be applied on SSAD, only it is not possible to directly shape the 
singular values of interest. After initial attempts at applying LQG/LTR it became apparent 
that conventional H 2 allows a more direct approach the problem since B 1 and D 2 \ are physical 
parameters. 
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6.2 H<i Results 


In the H 2 problem, the design parameters are the choice of weights on the contents of the 
error vector, y and u. In SSAD, since y corresponds to the end effector states, we can choose the 
Ci weights as we selected Q when designing a LQR. We multiply each channel corresponding 
to an attitude by an order of magnitude to account for the difference between position and 
attitude singular values in the open-loop system. In this manner a one inch error is not treated 
as serious as a one radian error by the H 2 controller. This choice is only an initial guess and 
can be modified to bring the closed-loop system within performance objectives. 

The weights on the control effort, Di 2 , are initially placed at the inverse of the joint rate 
limits as in the LQR design. Again, this design parameter is tuned until a satisfactory design 
is achieved. One thing to note, the final weights of the design do not have to correspond 
to the final weights of the LQR design. Recall the differences between the two design plants; 
LQR assumes full state feedback uncorrupted by noise while the H 2 compensator has only the 
observation vector which is also corrupted by sensor noise. 

Section 6.2.1 examines the control effort and performance singular values of the // 2 design. 
A time domain simulation driven by the same jet firing pattern as used on the LQR is included 
as well to help interpret the singular values. Section 6.2.2 examines the stability robustness of 
the H 2 design. 

6.2.1 ff 2 Control Effort and Performance 

Figure 6.2 shows the singular values from disturbance to each individual control effort 
channel after tuning the weights on the error vector. As before, each control weight is fine 
tuned until the scaled singular value falls just below one. Notice that two of the control effort 
singular values actually lie below one. These are the singular values corresponding to elbow 
pitch and wrist pitch joints. 

During the design process, if a particular control effort singular value fell below one, the 
corresponding weight was reduced allowing the compensator to use more control in that channel. 
Conversely, if a control effort singular value fell above one, the corresponding penalty was 
increased. For the elbow pitch and wrist pitch joint rates, the penalties were successively reduced 
up until the weighting matrix, D 12 started to numerically fail the rank tests. Nevertheless, the 
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Figure 6.2. Control Effort Singular Values of the System 

compensator chose not to use more control in these channels. 

There are two explanations for this result. First, the elbow pitch and wrist pitch joint rates 
may have limited ability to affect the performance objectives. Consider the simple example of a 
body with horizontal and vertical velocity as its controls. If we choose as performance objectives 
vertical position only, a controller would have no reason to use the horizontal control channel. 
This seems unlikely for SSAD since clearly from Figure 6.2 it is using some control in these 
channels. If the penalty is made less on these channels, it should use more control even if the 
benefit is slight. A more likely explanation is that use of the elbow pitch and wrist pitch joint 
rates is penalized through some coupling in the system. Use of elbow pitch may, for instance, 
reduce the error in the X direction but cause large excursions in Y which would require use 
of shoulder pitch to compensate. Thus the weight on shoulder pitch indirectly penalizes elbow 
pitch as well. 

Figure 6.3 presents the singular values from disturbance to end effector position and attitude. 
Included on the same plots is the corresponding information for the open-loop system. For the 
design we are barely able to meet the performance goal of less than 1 inch excursion in 
position. We are able to meet the attitude goal of less than 0.01 radians of rotation excursion 
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Figure 6.3. Performance Singular Values of the Hi System 
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without problem. Both goals are met without exceeding the joint rate limits. 

As with the LQR design, to help visualize what is occuring on the singular value plots, a 
time domain simulation is presented in Figures 6.4 and 6.5. These Figures include both the 
response of the open-loop system and the Hi system. As before, a roll command is given for 
a full 10 seconds immediately followed by a 10 second pitch command and finally a 10 second 
yaw command. 

From Figure 6.4 we can see that as in the LQR design, the X direction has considerably 
poorer disturbance rejection than the Y and Z directions for the H 1 compensator. Similar 
improvement over the uncompensated system as in the LQR design is also apparent. For the 
attitude directions, roll disturbance rejection is considerably better than the other axes as shown 
in Figure 6.5. Overall, the resulting H 2 design performance is only slightly below the full state 
feedback LQR design. 

6.2.2 H 2 Stability Robustness 

Applying both the small gain test and /i- analysis to the closed-loop Hi system results in 
Figure 6.6. As before, the system fails both stability robustness tests. With the LQR plant, we 
evaluated the eigenvalues of the closed-loop system for four sets of the parameters and showed 
that the system was stable for these endpoints. Though this is not conclusive proof, it suggests 
that it is possible that the LQR closed-loop system is indeed robustly stable. 

Evaluating those same sets of parameters for the closed-loop Hi design does show conclu- 
sively that the system is robustly unstable. Figure 6.7 shows the near pole locations for the 
nominal closed-loop Hi system. The following four Figures show the closed-loop poles for the 
four sets of perturbed parameters. Figure 6.8 places both the K \i a and the K( ; at their mini- 
mum values. Figure 6.9 places both sets of parameters at their maximum values. Figure 6.10 
places the Km A at their minimum values and the K a at their maximum values. Finally, Fig- 
ure 6.11 considers the last permutation of the Km a at their maximum values and the Ka at 
their minimum values. 

Aside from the nominal plant, the closed-loop system is only stable for Figure 6 9. In 
Chapter 4 when considering the open-loop plant, we were able to isolate the stability problem 
to when the system is operating at very low amplitudes. In this region, K m A is at its maximum 
and Ka is near its minimum. For the Hi system we cannot draw the same conclusions. Stability 
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Figure 6.5. Rotational Response of PH System and H2 System to Attitude Thruster Com- 
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Figure 6.7. Closed- Loop Poles of the Nominal H 2 System 
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H2 Closed-Loop Near Poles for Nominal Plant (Kma at min. Kg at min) 
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Figure 6.8. Closed-Loop Poles of the Off-Nominal H % System: 1 


H2 Closed-Loop Near Poles for Nominal Plant (Kma at max, Kg at max) 
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Figure 6.9. Closed-Loop Poles of the Off-Nominal Hi System: 2 
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H2 Closed-Loop Near Poles for Nominal Plant (Kma at min, Kg at max) 
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Figure 6.10. Closed-Loop Poles of the Off-Nominal H 2 System: 3 
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Figure 6.11. Closed-Loop Poles of the Off-Nominal System: 4 
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is a problem for the closed-loop system in several configurations. If the stability problem was 
caused by neglected dynamics, a solution to robust stability would be to reduce the bandwidth 
of the system. This could be accomplished by either further penalizing the control effort or 
reducing the weights on the end effector states, ie., demand less performance. Unfortunately, 
with parametric uncertainty the low frequency system poles simply aren’t where the controller 
is expecting them. Reducing the bandwidth will not solve our problem. We will explore this 
issue in further detail when considering the H „ 0 design. 

6.3 Summary 

This Chapter removes the requirement for full state feedback uncorrupted by noise and 
presents the f/ 2 (or LQG) approach. This approach involves designing a Kalman filter to 
estimate all of the system states from the observation and control vector and then using these 
estimates as perfect state information fed into a LQR. 

On the nominal plant, the H 2 approach is able to recover performance comparable to the 
LQR design. Both performance objectives are met without exceeding the limits on the control 
effort. Unfortunately, the closed-loop H% system is not robustly stable in face of the parametric 
uncertainty present in the plant as indicated by both /i-analysis and the small gain test. Though 
these tests are both conservative for real parametric uncertainty, eigenvalue analysis shows 
conclusively that the system is not robustly stable. 
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Chapter 7 


Hoo Design 


In the last Chapter we designed a compensator that meets performance specifications within 
allowable control effort specifications by minimizing the /f 2 norm of the transfer function from 
exogenous inputs to the error vector. This Chapter solves a similar problem, only this time 
performance is measured with an H norm. The H ^ norm is defined in the frequency domain 
for a stable transfer function matrix as 


ll<?(»)l|oo 


sup &[G(ju)] 


Mi 
IS. IMh 


(7.1) 


Using this norm, an H optimization design technique will result in the compensator that 
minimizes the supremum over all frequencies of the maximum singular value of the transfer 
function of interest. 

Recall that the disturbances acting on the SRMS are caused by Shuttle attitude thruster 
firings. The Shuttle may fire its thrusters in pulses every 80 milliseconds, or for steps lasting as 
long as 200 seconds. Such a firing profile has the ability to excite the system at any frequency 
up through its bandwidth. This makes the H ^ norm a natural selection for optimization. We 
would like to reduce the worst case gain for disturbances of the SRMS at all frequencies. 

There are many parallels between the // 2 and H a o approaches. Development of the H 
theory in Section 7.1 flows naturally from the work of last Chapter. As in the // 2 case, we 
arrive at two algebraic Riccati equations. The H 0 0 Riccati equations can be thought of as 
those that arise from a linear quadratic differential game between two opposing players. One 
player chooses the disturbance to maximize the norm of the output while the opposing player 
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Figure 7.1. Plant With Compensator 

chooses the control input to minimize it. The work in this Section follows that of Reference [17]. 
Though not presented, the H ^ design technique can also be derived by recasting the system as 
a model-matching problem via the parametrization of all stabilizing controllers. An exposition 
of this technique is presented in Reference [18]. 

Section 7.2 analyzes the results of the Hoo design using the same criteria as in previous 
Chapters. The singular values from disturbance to control effort and disturbance to the end 
effector states are examined and compared to the open-loop system. To highlight the differences 
in optimizing the H 2 norm versus the norm, a comparison of these two designs is presented 
as well. As before, a time domain simulation of the same jet firing pattern is presented. 
Finally, stability robustness of the closed-loop H system is examined in face of the parametric 
uncertainty. By choosing a different transfer function to optimize, namely the output of the 
plant uncertainties to the input of the plant uncertainties, we can determine if any H 0 0 controller 
would be able to robustly stabilize the plant. 

7.1 Hoc Theory 

For the H a 0 design we will use the same general two-block diagram as shown in Figure 7.1. 
Define the error vector the same as for the H 2 design, namely 

y 

e = 

u 

Using the same notation as in the last Chapter, write the transfer function from the exogenous 
inputs to the error vector as 
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e = P t (P,K)d (7.2) 

The H^c optimal compensator is the compensator that minimizes the supremum over all 
frequencies of the maximum singular value of Tt (P, K) for stabilizing K. This can be written 
as 


||^(^^)lloo = sup <r(r f (P,K)(ju)) (7 

K stabilizing K * 

In practice however, contrary to the H 2 problem, //<» optimal controllers are more difficult to 
characterize than suboptimal ones. We settle for an admissible K such that \\Tt\P^ ^)|loo < 7* 
Before characterizing the //<*, suboptimal controllers, consider the transfer function matrix 


mm 

stabilizing 


of Figure 7.1 to be in the same form as the H 2 problem. That is 


*p 


A 

B , 

B* 


x r 

e 

= 

c, 

0 

D\i 


d 

y 


c, 

D<i\ 

0 


u 


with 

D n = [0 if and D 21 = [0 /] 


(7.4) 


He# theory requires that the following must hold true for the system: 


(a) is stabilizable and detectable. 


(b) (j 4, is stabilizable and detectable. 


(c) D 12 7 [C\ Dw] = [0 /]. 


(d) 


Bi 
Di i 


£>21 7 = 


The weights on the states in the error vector, e, must be chosen to insure (a) is met. As 
shown in Chapter 4, (b) is known to be true for SSAD. To meet conditions (c) and (d) it will be 
necessary to apply the same transformations on the system used in the H 2 problem as shown 
in Section 6.1. Implicit in the given state-space realization for Figure 7.1 is that Dy\ = 0 
and D 22 =0. In a general H 0 0 problem these conditions can be relaxed, though it would 
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complicate the formulas presented here considerably. A treatment of the more general problem 
can be found in Reference [19]. 

The Hoc suboptimal problem is to find an admissible K such that \\Ff (P, /Olloc < 7. Its 
solution involves two Hamiltonian matrices as in the H 2 problem 


J 


00 


A p 7 -2 B 1 B-\ r — B 2 B 2 1 
-Ci'Ci -A p r 

A p r T " 2 Ci r C,-C 2 r C 2 
-BiV -A p 


(7.5) 

(7.6) 


These Hamiltonians are remarkably similar to the i/2 formulation. The main difference is 
that the additional terms in the upper-right entry of both F and Joe make the two Hamil- 
tonians no longer sign definite. Therefore we cannot guarantee that either F 0 0 or /oo has a 
Riccati solution or that the solution is positive definite. In fact, this limitation is related to the 
existence of a Hoo suboptimal controller. 

In Reference [17], Doyle et al prove that there exists an admissible controller, K , such that 
\\FtX p , K )\\oo < 7 the following conditions hold true 

(0 F 0 o € dom(Ric) and X ^ := Ric(F 00 ) > 0. 

(ii) Joe € dom(Ric) and Yoo := Ric(/oo) > 0. 

(iii) p(X oo y oo ) < 7 2 where p(M) is the spectral radius of M. 

When all three of these conditions hold, one such suboptimal controller is 


where 


K Muh {s) 


Aoo 

Bn 

Coo 

0 


(7.7) 


Aoo = Ap+ 7 ~ 2 B,B } r X ao + S 2 Coo- SocC 2 (7.8) 

Bn = (/-7“*no^oo)"VooC3 r 

Cn = -B^’Xn 


99 



Conditions (i) through (iii) suggest the following method, known as 7 iteration, to calculate 
an H a 0 ‘optimal’ controller: select a positive number 7; test conditions (i) through (iii); increase 
7 if failed or decrease 7 if passed; repeat until within a desired tolerance. 

To illustrate the relationship between the and H2 solutions, notice that as 7 — ► 00, 
— ► X2y etc., and K — ► K2 • This can be seen directly from the two sets of Hamiltonians 
in the H a Q and H2 problem formulations; as 7 — ► 00 the H ^ Hamiltonians become the H2 
Hamiltonians. 


7.2 H 0 o Results 

As in the H2 problem, the design parameters of the H a 0 problem are the choice of weights 
on the contents of the error vector, namely y and u. In design, select the C\ weights for the H 0 Q 
problem as before when designing both the H2 and LQR controllers. That is, multiply each 
channel corresponding to an attitude by an order of magnitude to account for the difference 
between position and attitude singular values of the open-loop system. This is to ameliorate 
the difference between a one inch error and a one radian error. 

The weights on the control effort, D\ 2 ) are treated as the main tuning parameter. The 
weight on each control channel is increased or decreased appropriately until every control effort 
singular value does not exceed the corresponding joint rate limit. The final weights of the H 0 G 
design need not correspond to the weights of the H2 design since a different norm is being 
optimized. 

Section 7.2.1 examines the control effort and performance singular values of the H 0 0 design. 
The performance singular values are compared not only to the open-loop plant but also to the 
H2 design in order to highlight the differences in the two design techniques. A time domain 
simulation driven by the same jet firing pattern used on the other designs is included as well to 
help interpret the singular values. Section 7.2.2 examines the stability robustness of the H ^ 
design. By choosing a different transfer function’s H ^ norm to minimize, we can determine if 
it is possible to design a robustly stable compensator. 

7.2.1 H oo Control Effort and Performance 

Figure 7.2 shows the singular values from disturbance to control effort after tuning the 
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Figure 7.2. Control Effort Singular Values of the H System 

weights on the error vector. As before, each control channel weight residing in 2 is fine tuned 
until the scaled singular value falls just below one. 

In the H 2 solution, the controller did not make full use of every control channel, specifically 
elbow pitch and wrist pitch. For the design a different norm is designed for, the H norm, 
that is the supremum over all frequencies of the maximum singular value, not the area under 
the singular value curves as in the H 2 case. The H oo controller is able to make use of each 
control to its full potential since it is penalizing the control channels in a different way. 

Figure 7.3 presents the singular values from disturbance to end effector position and attitude. 
Included on the same plots is the corresponding information for the open-loop system. The 
Hao design is able to meet the performance goals of less than 1 inch excursion in position and 
less than 0.01 radians of rotation excursion without exceeding the joint rate limits. 

Figure 7.4 compares the singular values from disturbance to end effector position and at- 
titude for both the H ^ and H 2 systems. Because the H ^ design is trying to minimize the 
Hcq norm, it has sacrificed some performance at 2 rps so that the maximum singular value can 
be brought down. The difference between the two designs is not as dramatic as can be seen 
in other systems since the H 2 design does not contain any large resonances, or ‘hills,’ in its 


101 


Magnitude (PH: dashed H-inf: solid) Magnitude (PH: dashed H-inf: solid) 



Frequency (radians/second) 



Figure 7.3. Performance Singular Values of the H a 0 and Position Hold Systems 
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Figure 7.4. Performance Singular Values of the Hoc and H2 Systems 
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maximum singular value to begin with, it is already flat. The H^q design lowers the maximum 
singular value for both position and attitude at the expense of some of the other directions. 

The presence of the control effort in the error vector acts like an H*i constraint on the H ^ 
design limiting the bandwidth of the closed-loop system. The H oo design sans control effort in 
the error vector would have a theoretically infinite bandwidth; theoretical because even if we 
accept exceeding the joint rate limits to increase the bandwidth, the actual system will run into 
physical constraints on the control effort. 

As with the previous designs, Figures 7.5 and 7.6 present a time domain simulation. As 
before, a roll command is given for a full 10 seconds immediately followed by a 10 second pitch 
command and finally a 10 second yaw command. These Figures include the responses of the 
H oo system as well as both the open-loop and the H 2 system. 

From Figure 7.5 it is apparent that as in the previous designs, the Hoq compensator’s 
X direction has poorer disturbance rejection than the Y and Z directions. For the attitude 
directions, roll disturbance rejection is better than the other axes as shown in Figure 7.6. 
Compared to the H 2 system, the H ^ system is better in some directions and slightly worse 
in others. As expected, the worst case direction is better for the H ^ design, namely X for 
the position directions and pitch for the attitude directions. These results confirm the singular 
value plots that show the H design offers a better worst case direction at the expense of 
some of the other directions. Nevertheless, both controllers are adequate to achieve our desired 
performance objectives. 

7.2.2 ff 0 0 Stability Robustness 

Figure 7.7 shows the results of applying both the small gain test and /i-analysis to the closed- 
loop H 0 o system. As with every other design, the H system fails both stability robustness 
tests. Experience with the other designs in previous Chapters show that these tests may be 
poor indicators of the system’s true robust stability. 

Eigenvalue analysis of both the open-loop system and the H 2 design show conclusively 
that these two systems are not robustly stable in face of the parametric uncertainty present 
in the model. However, for the LQR design the closed-loop system is stable for four sets of 
the parameters at their endpoints. Though this is not undisputable proof, it suggests that it 
is possible the LQR closed-loop system is indeed robustly stable. This is a crude method for 
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End Effector X Position 



Figure 7.5. Translational Response of PH, # 2 , and H ^ Systems to Attitude Thruster Com- 
mands 
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End Effector Attitude: Roll 



Figure 7.6. Rotational Response of PH, H 2 , and H ^ Systems to Attitude Thruster Commands 
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H-inf: Small Gain Test (solid) Mu-Analysis (dashed) 



Figure 7.7. Robust Stability Analysis of the H 0 0 System 

examining robust stability; however the nature of our system leaves few options. Though the 
Hoo system fails both stability robustness tests, let us examine the eigenvalues of the closed-loop 
system at certain endpoints of the parameters as before. 

Figure 7.8 presents the near pole locations for the nominal closed-loop system. The 
following four Figures show the closed-loop poles for the four sets of perturbed parameters. 
Figure 7.9 places both the Km a and the Ka at their minimum values. Figure 7.10 places both 
sets of parameters at their maximum values. Figure 7.11 places the Km A at their minimum 
values and the Ka at their maximum values. Finally, Figure 7.12 considers the last permutation 
of the Km A at their maximum values and the Ka at their minimum values. Evaluating these 
sets of parameters for the closed-loop H ^ design, as performed for the LQR and H 2 systems, 
shows conclusively that the system is indeed not robustly stable. 

Aside from the nominal plant, the closed-loop system is only stable for system 2 shown in 
Figure 7.10. This is also the case for the H 2 system considered in Chapter 6. Similar to the 
H<i plant, it is not possible to draw any conclusions for the H ^ system about certain regions 
of operation that are causing instability since stability is a problem for several configurations. 

If the stability problem was caused by neglected dynamics, a solution to robust stability 
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Figure 7.8. Closed-Loop Poles of the Nominal Hoc System 



Figure 7.9. Closed- Loop Poles of the Off-Nominal //<*, System: 1 
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Figure 7.10. Closed-Loop Poles of the Off-Nominal H ^ System: 2 
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Figure 7.11. Closed-Loop Poles of the Off-Nominal H ^ System: 3 
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Figure 7.12. Closed-Loop Poles of the Off-Nominal H System: 4 

would be to reduce the bandwidth of the system. Unfortunately, with parametric uncertainty 
the system poles simply aren’t where the controller is expecting them. Reducing the bandwidth 
will not solve the problem. Chapter 4 shows that the open-loop system itself is not robustly 
stable in face of the parametric uncertainty present. Therefore even a zero as the transfer 
function for the controller will not stabilize the plant. However, using the H 0 0 design process 
it is possible to design for robust stability directly to see if it is achievable for the SSAD plant. 

Recall the general three block formulation shown in Figure 7.13 and its closed-loop repre- 
sentation shown in Figure 7.14. G(s) is the closed-loop transfer function which combines the 
dynamics of the plant and the compensator. Recall that the closed-loop transfer function, G(s), 
can be divided into a set of four transfer functions as follows, 


e G2l(s) ^22 (^) 

Chapter 4 shows that if ||C?i 1 (-s) ||oo < 1? robust stability is achieved via the small gain test. 
Therefore, the H <*, design process will design for robust stability directly with no consideration 
for performance if the input and error vectors are chosen such that the transfer function from d 


(7.9) 
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Figure 7.13. Standard Form for a Compensated Plant with Uncertainty 



Figure 7.14. Closed-Loop Transfer Function 
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to e is Gn(s). The H ^ design methodology will try to minimize the H ^ norm of this transfer 
function. If it can bring the maximum singular value below one, by the small gain test robust 
stability is guaranteed. 

Before proceeding on this robust stability design, there is a subtlety to consider. In order 
to insure that the rank conditions outlined in Section 7.1 are not violated, the exogenous input 
vector, d, must include not only the output of the uncertainty block 77 , which replaces the 
process noise, but must also include the sensor noise. Similarly, the error vector e must include 
u as well as e. These are conditions (c) and (d) from Section 7.1. Hence, 
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e = 
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u 

. 
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The subtlety is that if u is penalized by being included in the error vector, the ability to achieve 
robust stability is also penalized since it may require large amounts of control. Remember 
that for now, performance is not considered at all, the only concern is to see if any controller 
can robustly stabilize the system 1 . Therefore, the design process to achieve a robustly stable 
compensator is to decrease the penalty on u until the maximum singular value of the closed-loop 
system falls below unity. 

After decreasing the weight on the control channels to 10 10 , the plant starts to numerically 
fail the rank tests. Unfortunately, at this point the plant still fails to achieve robust stability. 
Figure 7.15 presents the results of both the small gain test and /i-analysis on this ‘Gu’ H ^ 
design. Though robust stability is not achieved, the curves are at their lowest levels out of 
every design, including the open- loop system. However, do not think of this system as less 
robustly unstable since robust stability is a binary test that this design fails. Even this attempt 
at robust stability has come at enormous cost. 

Recall that this design gives basically no regard for performance whatsoever. Figure 7.16 
shows the scaled singular values from disturbance to control effort. The joint rate limits have 
been exceeded by almost five orders of magnitude. Figure 7.17 presents the singular values 
from disturbance to end effector position and attitude. Clearly, the price is quite high in the 
effort to achieve robust stability. Unfortunately the goal is not even achieved. 

’The LCJft system is robustly stable, but it requires full-state feedback 
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Figure 7.15. Robust Stability Analysis of the c Gn ’ H a 0 System 



Figure 7.16. Control Effort Singular Values of the ‘Gn’ H ^ System 
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Figure 7.17. Performance Singular Values of the ‘<?n’ H oo System 
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7.3 Summary 

This Chapter presents a method for finding H „ 0 optimal controllers that minimize, to a 
desired tolerance, the norm of the transfer function from disturbance to error vector. This 
problem is closely related to the Hi design process which is minimizing the Hi norm of the 
same transfer function. Both techniques involve a two Riccati equation solution process. Both 
remove the requirement for full state feedback uncorrupted by noise called for in the LQR 
design. 

With the Hao approach, on the nominal plant we are able to achieve performance better 
than the Hi design. Performance is better in the sense that the worst case direction is of 
less magnitude than the Hi design; however, this came at the expense of some of the other 
directions. Nevertheless, both performance objectives are met without exceeding the limits on 
the control effort. 

Unfortunately the closed-loop Hoc system, like the Hi system, is not robustly stable in face 
of the parametric uncertainty present in the plant. As before both ^-analysis and the small 
gain test as well as actual eigenvalue analysis confirm the results. In an effort to achieve a 
robustly stable design, the problem is recast to minimize the Hca norm of the Gn transfer 
function. This essentially abandons all performance desires and tries only to achieve robust 
stability. This process fails to achieve robust stability as well. 
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Chapter 8 


Conclusions 


Three regulator designs were developed to improve the performance of the Space Shuttle 
Remote Manipulator System (SRMS). Designs focused on the Position Hold function of the 
SRMS. Position Hold is universal to normal operations of the SRMS. In Chapter 2, the SRMS 
system was examined in greater detail. A linear model of the nonlinear plant was developed 
in Chapter 3. Two of the significant nonlinearities were modelled as parametric uncertainties 
in an effort to capture their effect on the system. The linear model was compared to a higher 
fidelity nonlinear model in the time domain. 

In Chapter 4, the characteristics of the open-loop plant were explored. Because of the 
numerical problems associated with six integrator poles, subsequent regulator designs used 
the current Position Hold system as the open-loop plant. In this manner, the current system 
moved the integrator poles off the ju;-axis. In effect, the new designs operate as stability and 
performance augmentation around the current system. Model reduction was applied to the new 
open-loop plant. 

The performance and control effort of the open-loop system were presented in Chapter 4 as 
a baseline with which to compare subsequent regulator designs. The performance and control 
effort criteria were analyzed in the frequency domain by looking at three sets of singular values: 
disturbance to control effort, disturbance to end effector position, and disturbance to end effec- 
tor attitude. A typical Shuttle attitude control jet firing pattern, which acts as a disturbance 
on the system, was simulated in the time domain. 

Also in Chapter 4, a sufficient condition for robust stability was developed using the small 
gain theorem with an H a 0 norm. The structured singular value, /i, was defined and shown to 
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provide a necessary and sufficient condition for robust stability. In calculation, /i-analysis is 
reduced to a sufficient condition, but it is less conservative than the small gain test. Unfortu- 
nately these tests are conservative for lightly damped systems with real parametric uncertainties 
— the exact description of the SRMS model. Application of these tests on the current system 
determined it to be unstable. As confirmation, the eigenvalues of the system were calculated 
for certain values of the uncertain parameters. For conditions corresponding to low amplitude, 
slow motion, the current system was found to indeed be unstable. This corresponds to the slow, 
low amplitude limit cycling observed in the actual system. 

A Linear Quadratic Regulator (LQR) was designed in Chapter 5. Its performance and 
control effort were a considerable improvement over the current system. Though the LQR 
failed both the small gain test, and /i-analysis, actual eigenvalue analysis seemed to indicate 
that the system may be robustly stable. Unfortunately this cannot be proved unequivocally. 
Since full-state feedback is not available on the SRMS, two model based compensators were 
designed. 

In Chapter 6 the H 2 technique and in Chapter 7 the H 0 0 technique were presented. There 
are many parallels between these two techniques. In both, an error vector is constructed from 
the controls and some linear combination of the states. As the name implies, the H 2 technique 
designs an optimal compensator that minimizes the H 2 norm of a transfer matrix using only the 
system measurements. Similarly, the technique minimizes the Hoc norm. Both designs were 
able to improve the performance of the nominal system considerably. However, neither design 
achieved robust stability. Not only did both designs fail the small gain test and /i-analysis, but 
actual eigenvalue analysis showed the systems were indeed not robustly stable. 

In an effort to achieve a robustly stable design, In Chapter 7 the problem was recast to 
minimize the //oo norm of the Gn transfer function. By the small gain test, if the H oo norm 
of this transfer function falls below one, robust stability is assured. This essentially abandons 
all performance desires and tries only to achieve robust stability. Unfortunately, this process 
fails to achieve robust stability as well. This modified design did establish the trade-off present 
in the pursuit of robust stability at the expense of performance and control effort. 

If the stability problem was caused by neglected dynamics, a solution to robust stability 
would be to reduce the bandwidth of the system. This could be accomplished by either further 
penalizing the control effort or reducing the weights on the end effector states, ie . , demand 


117 



less performance. Unfortunately, with parametric uncertainty the low frequency system poles 
simply aren’t where the controller is expecting them. Reducing the bandwidth will not solve 
the problem. Considering the open-loop system itself is not robustly stable, even a zero for the 
compensator will not make the system robustly stable. 

A possible extension for this thesis would be to gain schedule the controllers over certain 
regions of the nonlinearities. In this manner, the uncertainty of the parameters would be reduced 
allowing a family of controllers to be robustly stable over their respective regions. A ‘real /i- 
analysis’ technique would aid the design process allowing for the removal of conservatism in the 
gain scheduled controllers. Additionally, the model can be modified to incorporate acceleration 
sensors or optical measurements to provide for different measurements when NASA chooses a 
new sensor suite. 

The SRMS model suffers from lightly damped poles that have large real parametric un- 
certainty. To achieve a single robustly stable design a ‘real ^-synthesis’ technique is needed. 
This technique must not only use the information of the structure of the uncertainty, but also 
account for exclusively real perturbations. The SRMS plant is an ideal candidate for a real 
^-synthesis technique. 
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